Gene GM21_1039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1039 
Symbol 
ID8136361 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1222591 
End bp1223910 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content61% 
IMG OID644868650 
Productsodium:dicarboxylate symporter 
Protein accessionYP_003020858 
Protein GI253699669 
COG category[C] Energy production and conversion 
COG ID[COG1301] Na+/H+-dicarboxylate symporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones74 
Fosmid unclonability p-value0.681651 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACAA AGAAGTTTTA CCAGCATCTG TACTTCCAGG TATTGACGGC GATCTCGTTC 
GGCGTAGCGC TCGGTTACTA CCTGCCGGAC ACCGGTACCG CGATGAAGCC CTTGGGTGAC
GGGTTCATCA AGATGATCAA GATGATCATC ACCCCCATCA TCTTCTGCAC CGTGGTCACC
GGCATCGCCG GCATGGACGA CATGAAAAAG GTCGGCCGCG TCGGCGGCAA GGCGCTCCTC
TACTTCGAGG CGGTCTCCAC ACTGGCGCTT GGCATCGGGC TCGTGGTGAT CAACATTATC
CAGCCGGGGG TCGGCATGAA CGCCGATGTC ACGAAGCTTG ACACGAAAGG GCTCGACACC
TACACCGCTA CCGCGGCCAA GGGGCACAGC TTCGTGGACT TCGTGCTGGG CGTCATCCCC
AACAGCGTGG TCGACGCCTT CGCCAAGGGC GAAATCCTTC AGGTGCTCTT CTTCGCCATC
CTGTTCGGCC TGGCGCTCTC CGCCATGGGC GAGAAGGGTA AGCCGGTGTA CCGCCTGATC
GACGAGGTGG CGCACGCCTT CTTCGGCGTG GTGAACATCA TCATGAAGTT CGCTCCCATC
GGCGCCTTCG GCGCCATGGC CTTCACCATC GGCAAGTTCG GGCTGGGCTC TTTGACCAAG
CTCGGGATGC TGATGGGAAG CTTCTACCTG ACCTGCCTGC TCTTCGTCTT CGTGGTGCTC
GGCACAATCG GCAAGCTGTG CGGCTTCAAC ATCTTCAAGT TCATCTCCTA CATCAAGGAA
GAGCTCCTCA TCGTGCTGGG GACCTCCTCT TCCGAATCCG CGCTTCCCCG CATGATGGCG
AAGCTGGAGA ACCTCGGCTG CTCCAAGTCC GTGGTCGGAC TGGTGATCCC CACCGGCTAT
TCCTTCAACC TGGACGGCAC CTCCATCTAC CTCACCATGG CCGCGGTGTT CGTGGCGCAG
GCGACCAACA CCCCGCTGGA CCTGACCCAG ACGCTGACCA TCCTGGGCGT GCTGATGCTC
ACCTCGAAAG GGGCCGCCGG CGTCACCGGC AGCGGTTTTG TTACGCTGGC CGCTACCTTT
GCCGCCATCC CCACCATCCC GGTCGCGGGG CTCGCCCTCA TCCTCGGTAT CGACCGCTTC
ATGTCCGAGG CGCGTGCCCT CACCAACCTG GTTGGTAACG GCGTCGCTAC CGTGGTCGTC
TCCCGCTGGG AGAACGAGCT GGACGTGGCC CGCATGTCGC AGGTGCTGAA CAAGGAATTG
GATGAGGCTG ACGGGGAGGC TGATCTGCTG ATGATGGACC CCGAGCCCGA AGAGGCTTAG
 
Protein sequence
MKTKKFYQHL YFQVLTAISF GVALGYYLPD TGTAMKPLGD GFIKMIKMII TPIIFCTVVT 
GIAGMDDMKK VGRVGGKALL YFEAVSTLAL GIGLVVINII QPGVGMNADV TKLDTKGLDT
YTATAAKGHS FVDFVLGVIP NSVVDAFAKG EILQVLFFAI LFGLALSAMG EKGKPVYRLI
DEVAHAFFGV VNIIMKFAPI GAFGAMAFTI GKFGLGSLTK LGMLMGSFYL TCLLFVFVVL
GTIGKLCGFN IFKFISYIKE ELLIVLGTSS SESALPRMMA KLENLGCSKS VVGLVIPTGY
SFNLDGTSIY LTMAAVFVAQ ATNTPLDLTQ TLTILGVLML TSKGAAGVTG SGFVTLAATF
AAIPTIPVAG LALILGIDRF MSEARALTNL VGNGVATVVV SRWENELDVA RMSQVLNKEL
DEADGEADLL MMDPEPEEA