Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1373 |
Symbol | |
ID | 8136701 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 1617999 |
End bp | 1619987 |
Gene Length | 1989 bp |
Protein Length | 662 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644868987 |
Product | SSS sodium solute transporter superfamily |
Protein accession | YP_003021190 |
Protein GI | 253700001 |
COG category | [R] General function prediction only |
COG ID | [COG4147] Predicted symporter |
TIGRFAM ID | [TIGR00813] transporter, SSS family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 0.00174329 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAAAAA GAATCGCCGC CATAACGCTC GCTCTTACGT TCTCGGTTTG CGCCGCCGCT CCCCTCGCAG TCGCAGCCTC GGCAGGAGCT GCCGCCGCGC CGCAGTCGCC GGCGGCGGTA GCGCCCGCGG CAACGGCTGC GCCGGCAGCC GCCGACGCGG CTGTGCCCAC CGCGGCAAAG GGCGCTTTAG CCAAGGGCGC AGATACCCAG CCGGCCAAGG TGACCCCCAA CCGCGGCATC ACCATCGGCA TGTTCGCGCT CATCATTGCC ATCACCATGG GGGTGGTGGT CTGGGCCGCG AAGAAAACGC AGACCGCCGC CGACTTCTAC ACCGCGGGCG GTGGGATCAC CGGCCTCCAA AACGGCTGGG CCATAGCCGG CGACTACATG TCGGCCGCCT CCTTCCTCGG CATGTCGGGG CTCATCTCGC TCTACGGCAT CGACGGCTTC ATGTACGCGG TCGGCCCCAT GTTCTCCTTC ATCGCCATCC TCTTGGTCAT CGCCGAACCG TGCCGCAACG CCGGCAAGTA CACGTTGGGA GACATCCTCT CCTTCAGGTC GTCGCCCAAG ATCGTGCGGG CCGTAGCTGC CCTTTCCACC GTCACCGTCT CCATCTTCTA CCTGATCGCC CAGATGGTCG GCGCCGGAAA GCTGATGCAG ATGCTGATCG GGATACCGTA CCGCGTCTCC GTCATCGGGG TCGGGGCGCT TATGGTCGCC TACGTCGTCT TCGGCGGCAT GAAGGCCACC ACCTGGGTGC AGATCATAAA GGCTTCGCTG TTGATGGGCG CGACGACGCT CCTCTGCATA CTGGTCGCAG CGAAGGCCGG CTTCAACCCG GTCTCCTTCT TCACTGACAT CGTGAACAGC CCTGCCATCC AGGATCACGT CAGGCTGAAC GTATTGAAGG ACGCGATTCC CAAGGCGGGG ATGGATTACG GCCAGCGCTT CATGGAGCCC GGACTCTTCC TGAAAAGCCC ACTGGACCAG ATCTCGCTGG GTATCGCCTG GGCGCTCGGC GCCGCCGGGC TGCCGCACAT CCTGATGCGC TTTTTCACGG TCCCCAGCGC CAAGGAGGCG CGCAAGTCGA TCATCATCGC CCTTTTCCTG AACTCCAGCT TCTTCTTCAT GATCAGCATC ATCGGTTTCG GAGCCGCGCT TTACCTGACC CCGCAGGGGA TCATGGCAGT GGACAAGGGT GGCAACATGG CGACGCTGCT TCTGGCGCAG CACATGGGTG GCGGCGCCGG CAGCCTCGGC GGAGACGTCT TCCTTGCCTT CATCTGCGCC GTAGCCTTCG CCACCATCCT CGCCGTCGTC TCGGGGCTCG TGCTCGCGGC CTCCGCGGCA ATAGCCCACG ACGTCTACGT GAACATCATC AAGGACGGCA AGGCCGACCA GCACCTGCAG GTGAAGGTGG CCCGCATCAC CTCCCTGTTC GTCGGCACCT CCGCCATCCT GATGGGCCTT GCGGCCGAGA AGGAGAACGT GGTCGCCCTG GTGGCGCTGG CCTTCGCGGT CGCCGCTTCG GGGAACTTCC CGGTGATCAT GCTCTCCTTG TTCTGGAAGA AGTTCAACAC CGCGGGGATC GTCTCCGGCC TCGTGGTCGG GACCGTCACC GCCCTCGCGC TGGTGGTGGT GTCGCCGGTG ATGACCTACC CCAAGAAGGT CGCGGCCGAC GCCAAGAAGA TCGTCGACAC CCTGGAGCTG AAACAGGCAT CCGGCGTGGC GCTGGCGGAC AAGGAGCTGA AGACCCTGGA GAAGTCGCGC GTGGAATATG AGAAGAACAA GAACGGCAGC TCCATGGTGG GGCTCGACAA GCCGATATTC CCGCTGAAGA ACCCGGGGAT CGTCTCGGTG CCGCTTGGAT TCCTCGCCGC CGTTTTCGGC TGCCTCCTGT TCCGCGACCG CCGCGCAGAG GATATGTTCT CCGAGATCGA CGTGCGGCAG AATACCGGCC TCGGGATAGC CAAGGCCACC GATCATTAG
|
Protein sequence | MKKRIAAITL ALTFSVCAAA PLAVAASAGA AAAPQSPAAV APAATAAPAA ADAAVPTAAK GALAKGADTQ PAKVTPNRGI TIGMFALIIA ITMGVVVWAA KKTQTAADFY TAGGGITGLQ NGWAIAGDYM SAASFLGMSG LISLYGIDGF MYAVGPMFSF IAILLVIAEP CRNAGKYTLG DILSFRSSPK IVRAVAALST VTVSIFYLIA QMVGAGKLMQ MLIGIPYRVS VIGVGALMVA YVVFGGMKAT TWVQIIKASL LMGATTLLCI LVAAKAGFNP VSFFTDIVNS PAIQDHVRLN VLKDAIPKAG MDYGQRFMEP GLFLKSPLDQ ISLGIAWALG AAGLPHILMR FFTVPSAKEA RKSIIIALFL NSSFFFMISI IGFGAALYLT PQGIMAVDKG GNMATLLLAQ HMGGGAGSLG GDVFLAFICA VAFATILAVV SGLVLAASAA IAHDVYVNII KDGKADQHLQ VKVARITSLF VGTSAILMGL AAEKENVVAL VALAFAVAAS GNFPVIMLSL FWKKFNTAGI VSGLVVGTVT ALALVVVSPV MTYPKKVAAD AKKIVDTLEL KQASGVALAD KELKTLEKSR VEYEKNKNGS SMVGLDKPIF PLKNPGIVSV PLGFLAAVFG CLLFRDRRAE DMFSEIDVRQ NTGLGIAKAT DH
|
| |