Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCG9842_B4055 |
Symbol | |
ID | 7181918 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus G9842 |
Kingdom | Bacteria |
Replicon accession | NC_011772 |
Strand | - |
Start bp | 1191013 |
End bp | 1192491 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643549010 |
Product | sodium/proline symporter family protein |
Protein accession | YP_002444681 |
Protein GI | 218896270 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0591] Na+/proline symporter |
TIGRFAM ID | [TIGR00813] transporter, SSS family [TIGR02121] sodium/proline symporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000025968 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 90 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTACGC AGATGTTAAT TTTAACTTCT ATCTCTATTT ACATGCTCGG GATGCTAATT ATCGGCTATT TCGCTTATAA GCAAACATCC AACTTAACAG ATTATATGCT TGGCGGGCGT ACACTAGGCC CCGCAGTAAC AGCATTAAGT GCTGGAGCAG CTGATATGAG TGGTTGGCTT TTAATGGGCT TACCCGGTGC AATGTTTAGC GTTGGATTAA GTAGTAGCTG GATTGCGATT GGCCTAACAT TAGGCGCATA TGCAAACTGG CTTTATGTCG CTCCTCGCTT ACGTACCTAC TCTGAAATTG CAAATAACTC TATTACTATC CCAGAATTTT TGGAACACCG TTTCCACGAC AAATCCCATA TGCTACGTTT AGTATCTGGA CTTGTTATTA TGATATTTTT CACGTTTTAT GTAGCTTCAG GATTTGTCTC TGGTGCTGTA TTATTCGAAA ATTCATTTGG ACTCAATTAC CATGTTGGTC TTCTTATCGT TGGTGGAGTT GTCGTAGCTT ACACATTATT TGGTGGATTT TTAGCTGTAA GTTGGACAGA CTTCGTGCAA GGAATCATTA TGGTAGTCGC TCTTATTCTT GTTCCAGTCG TAACAATTAT GCACGTAAAT GGACTTGGTC CAGCATTTGA AACAATTAAA TCTATCGATC CGGCATTATT AGATATTTTT AAAGGTACTT CTGTATTAGG AATTATTTCA TTATTCGCAT GGGGCCTTGG TTATGTTGGA CAACCACATA TTATTGTACG CTTTATGGCG ATTTCTTCTG TAAAAGAAAT TAAAAGTGCA CGACGTATTG GTATGAGCTG GATGATTTTC TCTGTTGCTG GAGCTATGTT TACTGGCCTT ATCGGTATTG CATACTATTC AAAAGCAGGT TTAAAACTTT CTGATCCAGA AACGATTTTC GTTGAACTTG GCACTATTTT ATTCCATCCA CTTATTACTG GATTTTTATT AGCAGCTATT TTAGCAGCTA TTATGAGTAC AATTTCTTCT CAACTTCTCG TTACTTCGAG TGCAGTAACA GAAGACTTAT ATAGAACATT CTTTAAGCGT GATGCTTCTG ATAAAGAACT TGTATTTGTC GGTCGTATGG CTGTTCTTGT TATTGCTTTA ATTGGATGTG CATTAGCACT TAAACAAAAT GATACGATTT TAGCTCTTGT TGGATATGCT TGGGCTGGGT TCGGTTCTTC ATTCGGACCT GCTATTTTAT TAAGCTTATA TTGGAAACGT ATGACGAAAT GGGGCGCGCT TGCTGGTATG GTTTCTGGTG CCGCTACTGT TATTATATGG ACTCAATTCA AATTCTTAAA AGATTTCTTA TATGAAATGA TTCCAGGTTT CGCTATTAGT TTACTAGCTA TCGTAATTGT TAGTTTACTA ACACAACCTT CAAAAGAAGT TGAAGAGCAA TTTGAGAATT TCGAAAAACA ACATAGTCAT AATCTATAA
|
Protein sequence | MSTQMLILTS ISIYMLGMLI IGYFAYKQTS NLTDYMLGGR TLGPAVTALS AGAADMSGWL LMGLPGAMFS VGLSSSWIAI GLTLGAYANW LYVAPRLRTY SEIANNSITI PEFLEHRFHD KSHMLRLVSG LVIMIFFTFY VASGFVSGAV LFENSFGLNY HVGLLIVGGV VVAYTLFGGF LAVSWTDFVQ GIIMVVALIL VPVVTIMHVN GLGPAFETIK SIDPALLDIF KGTSVLGIIS LFAWGLGYVG QPHIIVRFMA ISSVKEIKSA RRIGMSWMIF SVAGAMFTGL IGIAYYSKAG LKLSDPETIF VELGTILFHP LITGFLLAAI LAAIMSTISS QLLVTSSAVT EDLYRTFFKR DASDKELVFV GRMAVLVIAL IGCALALKQN DTILALVGYA WAGFGSSFGP AILLSLYWKR MTKWGALAGM VSGAATVIIW TQFKFLKDFL YEMIPGFAIS LLAIVIVSLL TQPSKEVEEQ FENFEKQHSH NL
|
| |