Gene GM21_2699 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2699 
Symbol 
ID8138041 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3142708 
End bp3144705 
Gene Length1998 bp 
Protein Length665 aa 
Translation table11 
GC content65% 
IMG OID644870303 
ProductSSS sodium solute transporter superfamily 
Protein accessionYP_003022493 
Protein GI253701304 
COG category[R] General function prediction only 
COG ID[COG4147] Predicted symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.0000000000000107835 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCATGA AGAAGATAAT CATAGCAGCC ACGCTGGCCC TCTCCGTTGC CGGTGCCGCC 
TTCGCCGAAG AGCAAAAGGC AGCCCCCGCC GCAGCCCCCG CGGTGAGCGC ACCCGCGGCA
CAGGGCGCAC CCGCCGCCGC AGTCGCCGCC CCCGCCGCCC AGGTGAACCA GGCCCAGGCG
GTCGCCACCC CGGCGCCGGC AGCTCCCGCG CCGGCCAAAA GGGAACTGAA GACCAACAAG
GTCATCACCA TCGGCATGTT CGCCGTCATC ATCGCCATCA CCATGGGGGT CGTCGTCTGG
GCCGCCAAGC AGACCAAGAC CGCTTCCGAC TTCTACGCCG CGGGCGGCGG CATCACGGGG
ACCCAGAACG GCTGGGCCAT CGCCGGCGAC TACATGTCGG CGGCCTCCTT CCTGGGGATA
TCCGGCCTGA TCTCGCTCTA CGGCTACGAC GGGTTCATGT ACTCGGTGGG CTGGCTGGTC
GCCTACATCA CGGTGCTCCT GATCGTCGCC GAGCCGTGCC GCAACGCGGG CAAGTACACC
CTGGGGGACA TCCTCTCCTT CCGTACCTCG CCGAAGCCGG TGCGCGCCTT CGCCGCCATC
TCCACCGTCG CCGTATCCAC CTTCTACCTC ACCGCGCAGA TGGTCGGTGC AGGCAAACTG
ATGGCGCTTC TCATCGGCAT CCCCTACAAG ATGTCCATCA TCGGCGTCGG CATCCTCATG
GTAGGCTACG TCGTCTTCGG CGGCATGGTC GCCACCACCT GGGTGCAGAT CATCAAGGCG
GGCCTCCTCA TGTCGGGCGC CTTCCTGCTC TCCTTCCTGG TCATGCTGAA GGCGGGCTTC
AACCCGATCG GTTTCTTCTC CACCATCGTC AGCAGCCCCG ATATCCAGGA CCACGTCTCG
AAGTTGGTAC TGAAGGACGG CGTCATCCTT GCCGGCGCAG ACGCAGGTCA GCGCTTCCTT
GAGCCCGGCC TCTACCTGAA GAACCCGCTG GACCAGATCT CGCTCGGCAT GGCCCTCGTG
CTCGGCACCG CCGGCATGCC GCACATCCTG ATGCGCTTTT TCACCGTCCC GACCGCACAG
GCGGCGCGTA AATCGGTCAT CATCGCGATG TTCATCATCG GCGGCTTCTA CGTCCTGACC
ACCCTGCTCG GCTTCGGCGC AGCGATCCAC CTCACCCCGC AGGGGATCAC CCAGGTCGAC
CCGGGCGGCA ACATGGCTAC CCTGATGCTG TCCCAGCAGA TGGGCGCCGA CATAGCACCT
GTGGTCGGTG ACCTCTTCCT CGCCTTCCTT TGCGCAGTCG CCTTCGCCAC CATCCTCGCC
GTCGTCTCCG GCCTGGTACT GGCTGCATCC GCGGCCATCG CACACGACAT CTACGTGAAC
GTGATCAAGG ACGGCCACGC CGACCAGCAC GAGCAGGTCT TCGCAGCCCG CGCCACCTCC
TTCGTGGTCG GCGCCTGCGG CATCATGATC GGCCTCGCGG CCGAGAAGCA GAACGTCGCC
CACCTGGTGG CGCTCGCCTT CGCGGTCGCC GCCTCCGGGA ACCTTCCGGT CGTGGTACTC
TCGCTCTTCT GGAGGAAGTT CAACACCGCC GGCGTCATCT CCGGCCTGCT GGTAGGCACC
ATCGCCTCCA TCGGTCTGGT GATGGTCTCC CCCAACATGA CCTACCCGAC CGTGGTGGCC
GCTGGCGCCA AGAAGGTGGT CGTTGCCATG GAGAAGAAGC AGGCCGCTCT GCCTGCCGGC
GAGACCTTGA ACGAGAAGGA CGCCAAGGCA CTCGCCAAGG CGAAGGTCGA CGCGCAGTTG
ACCGGTACCT CCATGATGGG CCTGGAGAAG CCGCTCTTCA CCCTGAAGAA CCCGGGCATC
ATCTCCATCC CGCTGGGCTT CATGGCCGCC ATCCTGGGTT GCCTCGCCTT CCCGAACAGG
CGTTCCGAGG AGATGTTCGA CGAGGTCTAC GTCCGCCAGA ACACCGGTTT GGGTATGGCC
AAGGCGGTCG AACACTAG
 
Protein sequence
MTMKKIIIAA TLALSVAGAA FAEEQKAAPA AAPAVSAPAA QGAPAAAVAA PAAQVNQAQA 
VATPAPAAPA PAKRELKTNK VITIGMFAVI IAITMGVVVW AAKQTKTASD FYAAGGGITG
TQNGWAIAGD YMSAASFLGI SGLISLYGYD GFMYSVGWLV AYITVLLIVA EPCRNAGKYT
LGDILSFRTS PKPVRAFAAI STVAVSTFYL TAQMVGAGKL MALLIGIPYK MSIIGVGILM
VGYVVFGGMV ATTWVQIIKA GLLMSGAFLL SFLVMLKAGF NPIGFFSTIV SSPDIQDHVS
KLVLKDGVIL AGADAGQRFL EPGLYLKNPL DQISLGMALV LGTAGMPHIL MRFFTVPTAQ
AARKSVIIAM FIIGGFYVLT TLLGFGAAIH LTPQGITQVD PGGNMATLML SQQMGADIAP
VVGDLFLAFL CAVAFATILA VVSGLVLAAS AAIAHDIYVN VIKDGHADQH EQVFAARATS
FVVGACGIMI GLAAEKQNVA HLVALAFAVA ASGNLPVVVL SLFWRKFNTA GVISGLLVGT
IASIGLVMVS PNMTYPTVVA AGAKKVVVAM EKKQAALPAG ETLNEKDAKA LAKAKVDAQL
TGTSMMGLEK PLFTLKNPGI ISIPLGFMAA ILGCLAFPNR RSEEMFDEVY VRQNTGLGMA
KAVEH