Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_1459 |
Symbol | proX |
ID | 4245773 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 2207730 |
End bp | 2208740 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 638106612 |
Product | glycine betaine transporter periplasmic subunit |
Protein accession | YP_721222 |
Protein GI | 113475161 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0408344 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACAAA AAAACCTAAA AATTATCTTA GGAACAATTT TAACGAGCAT ATTAATAATT TTTACATCTT GTCAACAAGT TCCTAATTCT CAAACGGTAA CTATTCGGAC TGCCCATAGT ACTTGGATAG AAGAGTCATT TCAAACTAAT ATTGTTAATA TTGGTCTAGA AAAACTCGGG TATAAGGTCG AAGAACCAAA ACAAATTGAA TACACAGCTA TCTATATTTC TATTGCTAAT GGAGACTTAG AATATAGTAC TATTTACTAC CAACCATCTC ATGAAAAATT CTTTGAAAAA GCTGGTGGTG AAGAAAAGTT AGAGGCAGTT GGAATTTTGA CACCTGATGG GATAGCTGGA TATCAAATAG ACAAAAAAAC AGCGGATAAA TATCAAATTA CTAACATTAA AGAATTAAAA AATCCAGAAA TAGCCAAGCT TTTTGACTGG GATAAAAACG GTAAGGCAAA TTTAGTTGGT TGTAATCCTG GTTGGGCTTG TGAGTTAAAT ATAGACCATC ACATCGAAGC TTATGGACTA GAAAATACAG TAGAACATAG TCGAGGTCAA TATAATATTC TATTAGTAGA TGCTATAACT CGCTATAAAC AAGGGCAACC TATTCTTTAT TTTGCCTATA ATCCTCATTG GATATCTGCT ATCTTAAAAC CAGGTAAAGA TGTAGTTTGG TTAGAAGTGC CTTTTACCTC TTTACCAGAT ATGAGAAATA TCACAAAAAA AGATACATTA CTAGATGGAA AAAATATTGG TTTTTCTAGA ACTCAACAGA GCATTGTGGC TAATAAAAAA TTTTTAGAGT CTAACCCAGT GGCTAAAAGG TGGTTTGAAT TAGTAGAAAT TCCAGTTGCA GATATGAATA CTGAAAGTTT ACGAATTAAA GAAGGTGAAG ACAAAGAAGA AGATATTTTA CGTCATGCTC AAGAGTGGAT AAAAAATAAT CAAGAAAAAT ATAATAATTG GTTAGAAATA GCAAAAAAAG CAGCTAATTA A
|
Protein sequence | MKQKNLKIIL GTILTSILII FTSCQQVPNS QTVTIRTAHS TWIEESFQTN IVNIGLEKLG YKVEEPKQIE YTAIYISIAN GDLEYSTIYY QPSHEKFFEK AGGEEKLEAV GILTPDGIAG YQIDKKTADK YQITNIKELK NPEIAKLFDW DKNGKANLVG CNPGWACELN IDHHIEAYGL ENTVEHSRGQ YNILLVDAIT RYKQGQPILY FAYNPHWISA ILKPGKDVVW LEVPFTSLPD MRNITKKDTL LDGKNIGFSR TQQSIVANKK FLESNPVAKR WFELVEIPVA DMNTESLRIK EGEDKEEDIL RHAQEWIKNN QEKYNNWLEI AKKAAN
|
| |