Gene GM21_1576 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1576 
Symbol 
ID8136907 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1836625 
End bp1837635 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content61% 
IMG OID644869189 
ProductTRAP dicarboxylate transporter, DctP subunit 
Protein accessionYP_003021389 
Protein GI253700200 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID[TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones100 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCTCA AAGCATGTCT GAAAGCATTG GCCATGGCAG CAGCGCTCGC CCTGCCGCTG 
AACGCCGTCG CGGCGCCGGC GCCGATCGTG ATCAAGTTCA GCCACGTCGT GGCGCAGCAC
ACCCCCAAGG GGCAGGCTGC GGACTACTTC AAGAAACTGG CTGAAGAGCG GACCAAGGGA
AGGGTCAAGG TCGAGGTGTA TCCGAACAGC CAGCTCTACA AGGACAAGGA AGAGATGGAA
GCGCTGCAGC TCGGCGCGGT ACAGATGCTG GCGCCTTCCC TCGCCAAGTT CGCGCCGCTG
GGCGTGAAGG AATTCGAGGT CTTCGACCTC CCCTTCATCT TCGACAACTA CCAGGAACTT
CACAAGGTGA CCCAGGGGCC GGTCGGCGCG AAGCTCCTCA AAAAGCTCGA GCGCAAGGGT
ATCCTCGGCC TCGCCTACTG GGACAACGGC TTCAAGGTGA TGAGCGCCAA CAAACCGCTT
AAATCCGTAA ACGACTTCCG CGGTCAGAAG ATGCGCATCC AGTCCTCCAA GGTGCTCGAC
TCCCAGATGC GTTCCGTAGG CGCCATGCCG CAGGTGCTCG CCTTCTCCGA GGTGTACCAG
GCACTGCAGA CCGGCGTCGT CGACGGCACC GAGAACCCGC CGTCCAACCT CTACACCCAG
AAGATGCACG AGGTGCAGAA ATACGTGACC CTCTCCGACC ACGGCTACCT GGGCTACGCC
GTCATCGTCA ATAAGAAGTT CTGGCAGGGA CTGCCGGCCG ACATCCGCAC CATCCTGGAA
GGGTGCATGA AGGACGCGAC CAAGTACGCC AACGACATCG CCAAGAAGGA CAACGAGGAG
GCGCTTGCCG GCGTCAAGAA GTCCGGCCGC AGCCAGTTGA TCAGCCTCAC CCCGCAGGAG
CGCACCGCCT GGAAGAAGGC GATGGACAAG GCGCACAAAA GTAACATGGG GCGCATCGGC
GCCGACATAA TCAAGGAAGT CTACGCGGCC ACAGGCTACA ACCCGAACTA G
 
Protein sequence
MSLKACLKAL AMAAALALPL NAVAAPAPIV IKFSHVVAQH TPKGQAADYF KKLAEERTKG 
RVKVEVYPNS QLYKDKEEME ALQLGAVQML APSLAKFAPL GVKEFEVFDL PFIFDNYQEL
HKVTQGPVGA KLLKKLERKG ILGLAYWDNG FKVMSANKPL KSVNDFRGQK MRIQSSKVLD
SQMRSVGAMP QVLAFSEVYQ ALQTGVVDGT ENPPSNLYTQ KMHEVQKYVT LSDHGYLGYA
VIVNKKFWQG LPADIRTILE GCMKDATKYA NDIAKKDNEE ALAGVKKSGR SQLISLTPQE
RTAWKKAMDK AHKSNMGRIG ADIIKEVYAA TGYNPN