Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_3386 |
Symbol | |
ID | 7976167 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 3412664 |
End bp | 3414118 |
Gene Length | 1455 bp |
Protein Length | 484 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 644800151 |
Product | sodium/proline symporter |
Protein accession | YP_002951290 |
Protein GI | 239828666 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0591] Na+/proline symporter |
TIGRFAM ID | [TIGR00813] transporter, SSS family [TIGR02121] sodium/proline symporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGTATTGA TTTCTGTCGC AGTGTATATG ATTGGGATGC TTCTTATTGG GTATTGGGCT TATAAACGTA CGTCCAACCT TTCCGATTAT ATGCTTGGGG GAAGGACGTT AGGCCCTGCG GTCACAGCGC TCAGTGCGGG GGCTTCCGAT ATGAGCGGTT GGCTGTTGAT GGGGCTTCCA GGAGCAATGT ATGCGCAAGG ATTAAGTGCA TCATGGATTG TTATTGGGCT TACGCTCGGA GCATATGCGA ATTGGTTATA TGTTGCGCCT CGTTTGCGTG TATATACGGA AGTAGCAAAT GATTCCATTA CCATCCCGGA ATTTTTAGAA AATAGATTTG GCGATACATC GAAGCTGCTT CGGTTAATTT CCGGTCTTGT TATTATGATT TTCTTTACCT TTTATGTATC TTCTGGTCTT GTATCTGGAG CGGTGCTGTT TCAAAACTCA TTTGGCGCAA GTTATCATAC AGGATTATGG ATTGTTGCCG GCGTCGTTGT GGCGTATACA TTGTTTGGAG GATTTTTGGC TGTTAGTTGG ACCGATTTTG TGCAAGGAAC GATTATGTTT ATTGCTCTTA TTCTTGTCCC GGCCGTAACG CTTTTCCATA CGGGCGGTGT CGGCGATACG TTTACTACCA TTAAAAACAT TGATCCTAAT TTGCTCGATT TATGGAAAGG AACTAGCTTC CTCGGTATTA TTTCGCTGTT TGCGTGGGGG CTTGGCTATT TCGGGCAGCC GCACATTATT GTCCGCTTTA TGGCGATTTC GTCGGTCAAG GAAATGAAAA GCGCCCGCCG CATCGGAATG GGGTGGATGA TTTTCTCCGT TGTCGGCGCG ATGTTGACAG GGCTGTTTGG AATCGCTTAC TTTTCACAGC ACGGCACTAA GCTCGATGAT CCGGAGACCG TATTTATCAA GCTTGGAGAA ATTTTATTCC ATCCGCTCAT CACCGGATTT TTGCTTGCGG CGATTTTAGC GGCCATTATG AGTACGATTT CTTCGCAGCT TCTTGTTACG TCCAGTTCCT TAACAGAGGA TTTATATAAA GTGGTATTCC GTCGTTCCGC TTCGGATAAA GAGTTGATTT TCGTCGGCCG TCTTTCCGTA TTAATTGTAG CGTTAGTAGC GTCCGCGTTC GCGTACACGA AAAACGATAC GATTTTAAAC TTGGTCGGTT ATGCGTGGGC AGGATTCGGT GCTTCGTTTG GTCCAGTCAT TTTATTAAGC CTGTTCTGGC GCCGCATGAC GAAATGGGGG GCGTTTGCCG GCATGGTCGC AGGAGCGATG ACCGTGATTC TTTGGACACA ATCGGAATAT TTGAAAAACT TGCTGTATGA GATGATTCCA GGTTTTGCAG CAAGCTTGGC CGCGATTGTT GTCGTTAGCT TGTTGACAAA AGCGCCGGAA GAAAAAGTTG TCGAGCAGTT TGACAAATTT AAAGCATCGT TATAA
|
Protein sequence | MVLISVAVYM IGMLLIGYWA YKRTSNLSDY MLGGRTLGPA VTALSAGASD MSGWLLMGLP GAMYAQGLSA SWIVIGLTLG AYANWLYVAP RLRVYTEVAN DSITIPEFLE NRFGDTSKLL RLISGLVIMI FFTFYVSSGL VSGAVLFQNS FGASYHTGLW IVAGVVVAYT LFGGFLAVSW TDFVQGTIMF IALILVPAVT LFHTGGVGDT FTTIKNIDPN LLDLWKGTSF LGIISLFAWG LGYFGQPHII VRFMAISSVK EMKSARRIGM GWMIFSVVGA MLTGLFGIAY FSQHGTKLDD PETVFIKLGE ILFHPLITGF LLAAILAAIM STISSQLLVT SSSLTEDLYK VVFRRSASDK ELIFVGRLSV LIVALVASAF AYTKNDTILN LVGYAWAGFG ASFGPVILLS LFWRRMTKWG AFAGMVAGAM TVILWTQSEY LKNLLYEMIP GFAASLAAIV VVSLLTKAPE EKVVEQFDKF KASL
|
| |