Gene GM21_2856 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2856 
Symbol 
ID8138199 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3323764 
End bp3325737 
Gene Length1974 bp 
Protein Length657 aa 
Translation table11 
GC content62% 
IMG OID644870457 
ProductSSS sodium solute transporter superfamily 
Protein accessionYP_003022646 
Protein GI253701457 
COG category[R] General function prediction only 
COG ID[COG4147] Predicted symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones152 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACATA TGTTCCTTCC CTTGCTGATG GTCCTGATCT TCGGTACCTC CGTGCTTGCA 
GCCGATGCAC CCGCCAAGAC CTCCCCCGGC GCCCCTGATA CCGTGGCGGC GTCCGTTGCC
CAAGCCCCGC CGGGGCTCGC CGCAGCCCCT CAATCGCCGG CCGCCGCTGC GACCAAGGCC
CCGGCACCGG CGCCGGGGGG CAAGAAGATG CAGACCAACC GCACTTTCAC CATCAGCATG
TTCCTCTTGA TCATCGCGGC GACCGCCGGG ATCGTGGTCT GGGCCTCGAA AAGCACCACC
ACCGCATCCG ATTATTACGC GGCGGGCGGG GGCATTTCCG GGATACAGAA CGGCTGGGCC
ATCGCCGGAG ATTTCCTCTC CGCAGCGACC TTCCTCGGGA TCACCGGACT CATGTCCCTC
TTCGGCCCGG ACGGGTTCAT GTACTCGGTG GGGATCATCA TCAGCTTCCT GACCATCCTC
CTCATCATCG CCGAGCCATG CCGCAACGCC GGCAAGTACA CCCTGGGCGA CATCCTCGCC
TTCCGTTCGT CTTCCCGGGT AGTGCGGGCG GTCGCGGCCC TGTCCGCCGT GGTGGTCTCC
ATCTTCTACC TGCTCGGGCA GATGGTGGGG GCGGGAAAAC TGATGCAGCT CCTTTTGGGG
ATTCCCTACA AGACGTCGAT CATAGGAGTC GGCGCCCTCA TCATCGTCTA CGTGGCGCTC
GGCGGGATGA AAGCGACTAC GTGGGTGCAG ATCATCAAGG CGGGACTTCT CATGTTTACC
GGGGTTGTCT TGAGCGTCGG CATACTCTGG AAATCCGGGT TCAGCGTTTT CGCCTTCTTC
GACAGCGTGG CGACCAGCCC GCAGATCCAG GATCACGTGC GCGGCGTCAT GAAACACCCG
GTCGCGCAGC CGGGATTCGA CTACGGCCAG CGCTTCCTGG AACTGGGGCT TTTCTTCAAG
AACCCGCTGG ACCAGATATC GCTCGGGCTG GCGTGGATCC TCGGGGCCGC CGGCCTGCCG
CACGTCATGA TGCGGTTTTT CACTGTTCCC AACGCCAAGG AGGCGAGAAA GAGCGTGGTC
GCCTCCATGT TCCTGATCGG CCTCTTTCTG ATCATGGTTT CCTTCCTGGG CTTTGGCGCC
GCCCTTTACG TCACGCCGCA GAAGATCATG GCGCTCGACA AGGGTGGGAA CATGGCGGGG
CTCATGATCG CGCAGTATAT CGGCGGCGGG GCGGGTACCG TCACCGGAGA CCTGCTGCTG
GCCTTCGTCT GCGCCGTCGC TTTCGCCACC ATACTCGCGG TCGTCTCGGG TCTGGTGCTG
GCATCCTCGG CCGCCATCGC CCACGATCTA TACGTGAACG TGGTGAAAAA AGGGAAGGCG
GATCAGGGGA CACAGATAAA GGTCGCGAGG ATCGCCTCCT TCTTCGTCGG CGCCATCGCC
ATCGTGCTCG GCATCGCCTG TGAGAACCTC AACATCGCGC AACTGGTCGG CCTGGCGCTT
GCCGTGGTGG CTTCGGCCAA CTTCCCGGTC CTTATCTTCG CGCTGTTCTG GAAGCGGTTC
AATTCAGCCG GCATCATCGC CGGGCTGGTG GTCGGCACCG TGGTGACCAT CGGGATCCTG
ATGGTTTCGC CCAACATGAC CTATCCCAAG AAGGTGGCGG CCGACGCCCA GAAGGTCGTG
CTAGCTTTGG AGAAAAAGCA AAGCGAGGCG GGGGGGCTTA CTGATGCGGA ATTGAAGACG
CTGGAAAAGG CGAAGTCGGA TTACGTCCTG AACAAGGACG GGACCTCGCT GGTGGGGCTC
GACGCGCCGC TTTTCCCTCT CAAGAACCCC GGCATACTCT CCGTGCCCAT CGGATTCCTG
GTCACCGTCG CCGCCACGCT ATTATTTCGC AACCGCCGCG AGGAAGAGAT GTTCGAAGAA
CTGTTTGTCC GGCAGACCAC GGGATACGGC ATGGCCAAGG CGGCCAAGCA CTGA
 
Protein sequence
MRHMFLPLLM VLIFGTSVLA ADAPAKTSPG APDTVAASVA QAPPGLAAAP QSPAAAATKA 
PAPAPGGKKM QTNRTFTISM FLLIIAATAG IVVWASKSTT TASDYYAAGG GISGIQNGWA
IAGDFLSAAT FLGITGLMSL FGPDGFMYSV GIIISFLTIL LIIAEPCRNA GKYTLGDILA
FRSSSRVVRA VAALSAVVVS IFYLLGQMVG AGKLMQLLLG IPYKTSIIGV GALIIVYVAL
GGMKATTWVQ IIKAGLLMFT GVVLSVGILW KSGFSVFAFF DSVATSPQIQ DHVRGVMKHP
VAQPGFDYGQ RFLELGLFFK NPLDQISLGL AWILGAAGLP HVMMRFFTVP NAKEARKSVV
ASMFLIGLFL IMVSFLGFGA ALYVTPQKIM ALDKGGNMAG LMIAQYIGGG AGTVTGDLLL
AFVCAVAFAT ILAVVSGLVL ASSAAIAHDL YVNVVKKGKA DQGTQIKVAR IASFFVGAIA
IVLGIACENL NIAQLVGLAL AVVASANFPV LIFALFWKRF NSAGIIAGLV VGTVVTIGIL
MVSPNMTYPK KVAADAQKVV LALEKKQSEA GGLTDAELKT LEKAKSDYVL NKDGTSLVGL
DAPLFPLKNP GILSVPIGFL VTVAATLLFR NRREEEMFEE LFVRQTTGYG MAKAAKH