Gene GM21_2702 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2702 
Symbol 
ID8138044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3146493 
End bp3148013 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content66% 
IMG OID644870306 
ProductNa+/solute symporter 
Protein accessionYP_003022496 
Protein GI253701307 
COG category[R] General function prediction only 
COG ID[COG4147] Predicted symporter 
TIGRFAM ID[TIGR03648] probable sodium:solute symporter, VC_2705 subfamily 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.0000000021967 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATGATT CCTCCTGGTC CATCTTCGTG GTCCTTTTGG TGCTGGCTTC CTTCATCGTG 
GTGGGGCTCC GGCAGAAGTC GCGCGAAAGC CAGGAGTACG GCTTCGGCGG CCGCTACACC
GGGAGGATCG GGGGGGGTGC GGCCATCGCC AGCAACTGGA TGAGCGCCGC GAGCCTCATG
GGGCTTGCCG GGATCATCTA TCTGCAGGGG TACCAGGGGC TCGCCTACGT GATCGGCTGG
ACCGGCGGCT ACGTGCTGCT CCTCGTGCTG CTGGCGAGCC AGATCCGCCG CTTCGGCAAG
TTCACCGCCC CCGAGTTCGT GGGGGAGCGC TACGGCTCCC AGGCCGCGCG CGCCATCGCC
GCCGCCATCT CCATCGCCAT CTCCATCATC TACTGCGTGG CCCAGTTCAA GGGGATAGGT
CTCATCTTCT CCTTCATGTT CGGCATCGAC TATCAGCAGG GGGTCATCTA CGGGGCCCTC
GCCGTGGTCT CCTACCTGGT CGTCTCCGGG ATGCTCGGGC TCCCCAGAAA CCAGCAGCTG
CAATACCTGG TGATCGGCGT CTCCTTCATC GTGCCGCTCA TGTGGCTCGC CCGAAAGCTC
GGCTACTTCT GGCTCCTCCC CCAGTTCGGC TACGGTCGCG CTGTTACCGA CCTGTCGCGC
CAGTTCGACA TCGACTTCAC CCTCCCCTTC GCCAACGGCT CCCTGTTCCA GTGGTGCGCG
CTCTGCTTCA CGCTCATGGT CGGCACCGCG GGGCTTCCCC ACGTGCTGTC GCGCTTTTAC
ACGGTACCCA ACGTGCGCGA CGCGCGCTGG AGCGTGGTCT GGGGGCTCTT CTTCATCGCC
CTCATCTACT GGTCCGCCCC CGCCTTCTCC GTCTTCGGCA GGCTCTTGGA GGCGAGAAGC
GGCGTTATCC CCGACCCCGC GGCGGCCCGC GCCACAGCCG ACGTGATCGC GCTTAAGACC
GCGGTTTGGG CAGGGCTTCC CGGTTGGCTC GTCGGGGTCC TCGCCGCGGG CGCCCTCTCG
GCCGCCTTCT TCACCGTGGC CGGGCTTTTG ATGACCGGCG CCGCCTCGAT CTCCCACGAC
ATCTACTATT CCATGTTCAA CCGCAGCGCC AGCGAGTCGG CGCGGATGCA GGTGGCCAAG
GGGGGGACCC TGGTGCTCGC CGCCATCGTG CTTCTATTGG CGCTCGACCC GCCGGGGCTG
ATCGCGGAGA TCACCGCCGT CGCCTTCGCT CTAGCCGGCA ACACCATCTT CCCGCTCTTT
CTTTTGGGGA TCTGGTGGGG GCGCGCCAAC CGCCACGGGG CCATAGCGGG GATGCTCACC
GGCATCGTCT GTACCGCCAT CGCCCCTCTT TGCGGCGGGA TTTTCCCGCA GCTGGCGCTT
CTCTTCCCGG TAACCTCGTC CGCGCTTCTG GGTGCCCCCC TGGTGATAGC GGTGATGATA
GCGGTTTCGC TTTTGACCCC CGCTCCGCCG GAGGAGATGG GGCGCTTTCT GGAAAAAGAG
GTGCACGGGC ACCTCGACTG A
 
Protein sequence
MNDSSWSIFV VLLVLASFIV VGLRQKSRES QEYGFGGRYT GRIGGGAAIA SNWMSAASLM 
GLAGIIYLQG YQGLAYVIGW TGGYVLLLVL LASQIRRFGK FTAPEFVGER YGSQAARAIA
AAISIAISII YCVAQFKGIG LIFSFMFGID YQQGVIYGAL AVVSYLVVSG MLGLPRNQQL
QYLVIGVSFI VPLMWLARKL GYFWLLPQFG YGRAVTDLSR QFDIDFTLPF ANGSLFQWCA
LCFTLMVGTA GLPHVLSRFY TVPNVRDARW SVVWGLFFIA LIYWSAPAFS VFGRLLEARS
GVIPDPAAAR ATADVIALKT AVWAGLPGWL VGVLAAGALS AAFFTVAGLL MTGAASISHD
IYYSMFNRSA SESARMQVAK GGTLVLAAIV LLLALDPPGL IAEITAVAFA LAGNTIFPLF
LLGIWWGRAN RHGAIAGMLT GIVCTAIAPL CGGIFPQLAL LFPVTSSALL GAPLVIAVMI
AVSLLTPAPP EEMGRFLEKE VHGHLD