Gene GM21_1373 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1373 
Symbol 
ID8136701 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1617999 
End bp1619987 
Gene Length1989 bp 
Protein Length662 aa 
Translation table11 
GC content64% 
IMG OID644868987 
ProductSSS sodium solute transporter superfamily 
Protein accessionYP_003021190 
Protein GI253700001 
COG category[R] General function prediction only 
COG ID[COG4147] Predicted symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value0.00174329 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAAAA GAATCGCCGC CATAACGCTC GCTCTTACGT TCTCGGTTTG CGCCGCCGCT 
CCCCTCGCAG TCGCAGCCTC GGCAGGAGCT GCCGCCGCGC CGCAGTCGCC GGCGGCGGTA
GCGCCCGCGG CAACGGCTGC GCCGGCAGCC GCCGACGCGG CTGTGCCCAC CGCGGCAAAG
GGCGCTTTAG CCAAGGGCGC AGATACCCAG CCGGCCAAGG TGACCCCCAA CCGCGGCATC
ACCATCGGCA TGTTCGCGCT CATCATTGCC ATCACCATGG GGGTGGTGGT CTGGGCCGCG
AAGAAAACGC AGACCGCCGC CGACTTCTAC ACCGCGGGCG GTGGGATCAC CGGCCTCCAA
AACGGCTGGG CCATAGCCGG CGACTACATG TCGGCCGCCT CCTTCCTCGG CATGTCGGGG
CTCATCTCGC TCTACGGCAT CGACGGCTTC ATGTACGCGG TCGGCCCCAT GTTCTCCTTC
ATCGCCATCC TCTTGGTCAT CGCCGAACCG TGCCGCAACG CCGGCAAGTA CACGTTGGGA
GACATCCTCT CCTTCAGGTC GTCGCCCAAG ATCGTGCGGG CCGTAGCTGC CCTTTCCACC
GTCACCGTCT CCATCTTCTA CCTGATCGCC CAGATGGTCG GCGCCGGAAA GCTGATGCAG
ATGCTGATCG GGATACCGTA CCGCGTCTCC GTCATCGGGG TCGGGGCGCT TATGGTCGCC
TACGTCGTCT TCGGCGGCAT GAAGGCCACC ACCTGGGTGC AGATCATAAA GGCTTCGCTG
TTGATGGGCG CGACGACGCT CCTCTGCATA CTGGTCGCAG CGAAGGCCGG CTTCAACCCG
GTCTCCTTCT TCACTGACAT CGTGAACAGC CCTGCCATCC AGGATCACGT CAGGCTGAAC
GTATTGAAGG ACGCGATTCC CAAGGCGGGG ATGGATTACG GCCAGCGCTT CATGGAGCCC
GGACTCTTCC TGAAAAGCCC ACTGGACCAG ATCTCGCTGG GTATCGCCTG GGCGCTCGGC
GCCGCCGGGC TGCCGCACAT CCTGATGCGC TTTTTCACGG TCCCCAGCGC CAAGGAGGCG
CGCAAGTCGA TCATCATCGC CCTTTTCCTG AACTCCAGCT TCTTCTTCAT GATCAGCATC
ATCGGTTTCG GAGCCGCGCT TTACCTGACC CCGCAGGGGA TCATGGCAGT GGACAAGGGT
GGCAACATGG CGACGCTGCT TCTGGCGCAG CACATGGGTG GCGGCGCCGG CAGCCTCGGC
GGAGACGTCT TCCTTGCCTT CATCTGCGCC GTAGCCTTCG CCACCATCCT CGCCGTCGTC
TCGGGGCTCG TGCTCGCGGC CTCCGCGGCA ATAGCCCACG ACGTCTACGT GAACATCATC
AAGGACGGCA AGGCCGACCA GCACCTGCAG GTGAAGGTGG CCCGCATCAC CTCCCTGTTC
GTCGGCACCT CCGCCATCCT GATGGGCCTT GCGGCCGAGA AGGAGAACGT GGTCGCCCTG
GTGGCGCTGG CCTTCGCGGT CGCCGCTTCG GGGAACTTCC CGGTGATCAT GCTCTCCTTG
TTCTGGAAGA AGTTCAACAC CGCGGGGATC GTCTCCGGCC TCGTGGTCGG GACCGTCACC
GCCCTCGCGC TGGTGGTGGT GTCGCCGGTG ATGACCTACC CCAAGAAGGT CGCGGCCGAC
GCCAAGAAGA TCGTCGACAC CCTGGAGCTG AAACAGGCAT CCGGCGTGGC GCTGGCGGAC
AAGGAGCTGA AGACCCTGGA GAAGTCGCGC GTGGAATATG AGAAGAACAA GAACGGCAGC
TCCATGGTGG GGCTCGACAA GCCGATATTC CCGCTGAAGA ACCCGGGGAT CGTCTCGGTG
CCGCTTGGAT TCCTCGCCGC CGTTTTCGGC TGCCTCCTGT TCCGCGACCG CCGCGCAGAG
GATATGTTCT CCGAGATCGA CGTGCGGCAG AATACCGGCC TCGGGATAGC CAAGGCCACC
GATCATTAG
 
Protein sequence
MKKRIAAITL ALTFSVCAAA PLAVAASAGA AAAPQSPAAV APAATAAPAA ADAAVPTAAK 
GALAKGADTQ PAKVTPNRGI TIGMFALIIA ITMGVVVWAA KKTQTAADFY TAGGGITGLQ
NGWAIAGDYM SAASFLGMSG LISLYGIDGF MYAVGPMFSF IAILLVIAEP CRNAGKYTLG
DILSFRSSPK IVRAVAALST VTVSIFYLIA QMVGAGKLMQ MLIGIPYRVS VIGVGALMVA
YVVFGGMKAT TWVQIIKASL LMGATTLLCI LVAAKAGFNP VSFFTDIVNS PAIQDHVRLN
VLKDAIPKAG MDYGQRFMEP GLFLKSPLDQ ISLGIAWALG AAGLPHILMR FFTVPSAKEA
RKSIIIALFL NSSFFFMISI IGFGAALYLT PQGIMAVDKG GNMATLLLAQ HMGGGAGSLG
GDVFLAFICA VAFATILAVV SGLVLAASAA IAHDVYVNII KDGKADQHLQ VKVARITSLF
VGTSAILMGL AAEKENVVAL VALAFAVAAS GNFPVIMLSL FWKKFNTAGI VSGLVVGTVT
ALALVVVSPV MTYPKKVAAD AKKIVDTLEL KQASGVALAD KELKTLEKSR VEYEKNKNGS
SMVGLDKPIF PLKNPGIVSV PLGFLAAVFG CLLFRDRRAE DMFSEIDVRQ NTGLGIAKAT
DH