Gene GSU2389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU2389 
Symbol 
ID2686577 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp2619301 
End bp2620449 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content65% 
IMG OID637127079 
ProductABC transporter, periplasmic substrate-binding protein, putative 
Protein accessionNP_953435 
Protein GI39997484 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.520842 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTAATCC GGCCCTTCGC GCGGCTTTTC GCCCTGCGTC CGGTTGCGTT GCTCGCCTGC 
GCGGCTGTCA TGGTTTCGAC CATGTCGGGA TGCCGACGCG ACGACTCTTT GAAAATCGGC
TATCTGGGCA CCCTCAGTGG CCGCCATTCC GATCTGGGCG TTGCCGGCCG CGACGGTACC
ATCTTTGCCG TGGAGGAGAT CAACCGGGCG GGCGGCATTA ACGGAAAGCG CCTGGAACTG
GTGATCCGCG ACGACGAGGG CAAGGCCGGG AGCGCCCAGA CTGCCGTGCG CGAGCTGATC
GCGGCCGGCG TCGCCGCCAT CGTCGGACCC ATGACCAGCT CCATGGCCAT GGCAACGGTG
CCCGTCATTA ACGGTTCGCC GGTGGTCATG ATCAGCCCCA CCGTCAGCAC GGGCGATCTC
AGCGGCAAAG ACGACAACTT CCTCCGCATT TATCCCACCA ACGCCCAGAA GACCCGGCAG
CTGGCCGAAT ATGCCCGCAC GAGCCTGGGG CTGTCCCGGC TGGCCGTGGT GTACGACCTT
TCCAACCGCT CCTACACCGA CGACTGGCGG CAGGCGTTCA CCCGGCAGTT CGAGTCGCTG
GGCGGCACCG TCGCGCCGGC GGTCTCCTTT GATGCGTCCG GCCAGACCGA TTACCTGTCC
GTGGCCAGAA CGCTCCTGGC AAAGAAGCCG CAGGGGGTTC TCATCCTGGC CGGTGCCGTT
GATGCCGCCA TGCTCTGCCA GCAGATCCGC AAGCTCGACA CTTCCACGGC CCTCTTCGCC
ACCGAATGGT CCAGCACGCC CGAGCTCCTC ATGCACGGCG GGACCGCGGT GGAGGGGATC
GTTTACTGCC AGAATTTCAT CCGCGATGAC GACTCCACCC CCTATGTGAC CTTCTGCCGG
GCCTTCGAGA CACGTTTCGG CCAGCTTCCC GACTTCGGGG CCGTCTATGC GTACCAGGCG
GTCAAGGTGA CCGGCGCGGG GCTGGCAAAA AATCCCGCGC CCCGGGGGCT GAAGGGGGCG
ATCCTCGGCA CCGGAACCTT TCCCGGTCTT CAGGGCGAGT TTACCATTGA TCGTTTCGGC
GATACCGACC GGAAGCCGTT CCTGATGACC GTGCGCCAGG GAGCGTTCCG GCGGGTGGAG
GTGCCGTGA
 
Protein sequence
MVIRPFARLF ALRPVALLAC AAVMVSTMSG CRRDDSLKIG YLGTLSGRHS DLGVAGRDGT 
IFAVEEINRA GGINGKRLEL VIRDDEGKAG SAQTAVRELI AAGVAAIVGP MTSSMAMATV
PVINGSPVVM ISPTVSTGDL SGKDDNFLRI YPTNAQKTRQ LAEYARTSLG LSRLAVVYDL
SNRSYTDDWR QAFTRQFESL GGTVAPAVSF DASGQTDYLS VARTLLAKKP QGVLILAGAV
DAAMLCQQIR KLDTSTALFA TEWSSTPELL MHGGTAVEGI VYCQNFIRDD DSTPYVTFCR
AFETRFGQLP DFGAVYAYQA VKVTGAGLAK NPAPRGLKGA ILGTGTFPGL QGEFTIDRFG
DTDRKPFLMT VRQGAFRRVE VP