Gene GM21_1647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1647 
Symbol 
ID8136978 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1916241 
End bp1917509 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content56% 
IMG OID644869260 
ProductABC transporter related 
Protein accessionYP_003021460 
Protein GI253700271 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1134] ABC-type polysaccharide/polyol phosphate transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones193 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAAGTG TGGAGAACAT CGGGAAACGG TACACCATTT CCCATAGAGG GAGTGACAGC 
AACCACAGTA TCAGCGACAC GGTGGGGGTG AAGCTGTCCC GTTTTTGGGG GAAACTGCTG
CACCCGTTCC GAGCCGGACA GCGGCCGCCC CGGAACAGCG AGGAATTCTG GGCCCTGCGT
GATGTTTCGT TCGAGGTGAA AGAAGGCGAG TGTGTCGGCA TCATCGGCGG CAACGGAGCC
GGGAAATCCA CCCTGCTCAA GATCCTTAGC CGGATCACCG AACCCTCCAC CGGCCGCATC
AGGATCAGAG GGAGGGTTGC AAGCCTGCTT GAGGTGGGAA CGGGTTTTCA CCCTGAACTG
ACCGGGCGTG AGAACGTTTA CTTGAACGGG ACCATGCTTG GGATGACCCG CAGTGAGATC
CGCAGCCGTT TCGACGAGAT CGTTGCATTC TCCGAAGTGG AGAGATTCCT CGATACCCCA
GTGAAGCGGT ACTCCTCCGG CATGTACGTA CGCCTCGCCT TTGCGGTGGC TGCGCACCTT
GAACCTGACA TCCTCATCGT GGACGAAGTC TTGTCGGTGG GGGATGCTCA ATTCCAGAAG
AAGTGCCTTG GGAAAATGGA GGATGTATCA GCGAGGCAGG GCAGAACCGT ACTTTTCGTC
AGCCACAACA TCCCTACCGT GCAAAGTCTC TGCAACCGGG GCATCCTGCT TCAAGGGGGG
CGGGTAGCGT GTCAGGGTGA CATCAGAGGT GTGACTCAAA ACTACCTGCG CGGTTCCCTG
CCCCAGTTAA CTACCGCAGC GGTTGAACTG CCTGGTGAGC ACCTGAAGAG GGTGCGCATC
TGTGATGCCG AAGGTGAGCC CTGCACCCTT TTTCCCATGG GATCCCCCTT CAGGGTTGAG
GTTGACAGCT GTGGTTTGGA TAGGGTTCCC GGTTCTCAGG TTAGCCTCTC CCTAAGGACC
GAGGAAGGGG GACGTATTTT CACCCTCAAC ACCGGTATGA GTTGCCGGTA CTTAGCGCAG
CAAAGGGGAG AACGGGAGAC CTTCATCCTG CAGGTGGACT GTCTGAACCT AGTACCCGGA
CGTTACCTGC TAGAAGTATC CCTAGCCCAA AAGGGTGTGG CGAGGATGGA GCATTACGAG
AATTTTGCCG AAATAACGGT CGTCGAACAT GATGTCTATG GGTCTGGCTA CATTCTCTCT
AGCCATTACG GGCTGGTTTT CTTACAGGGG GGATGGAGCG TCAAGAATCA AGAGGAAGCC
TTCAAGTGA
 
Protein sequence
MISVENIGKR YTISHRGSDS NHSISDTVGV KLSRFWGKLL HPFRAGQRPP RNSEEFWALR 
DVSFEVKEGE CVGIIGGNGA GKSTLLKILS RITEPSTGRI RIRGRVASLL EVGTGFHPEL
TGRENVYLNG TMLGMTRSEI RSRFDEIVAF SEVERFLDTP VKRYSSGMYV RLAFAVAAHL
EPDILIVDEV LSVGDAQFQK KCLGKMEDVS ARQGRTVLFV SHNIPTVQSL CNRGILLQGG
RVACQGDIRG VTQNYLRGSL PQLTTAAVEL PGEHLKRVRI CDAEGEPCTL FPMGSPFRVE
VDSCGLDRVP GSQVSLSLRT EEGGRIFTLN TGMSCRYLAQ QRGERETFIL QVDCLNLVPG
RYLLEVSLAQ KGVARMEHYE NFAEITVVEH DVYGSGYILS SHYGLVFLQG GWSVKNQEEA
FK