Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_1458 |
Symbol | proX |
ID | 4245772 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 2206086 |
End bp | 2207102 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 638106611 |
Product | glycine betaine transporter periplasmic subunit |
Protein accession | YP_721221 |
Protein GI | 113475160 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.104794 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAA CACCGCTAAA AATAATCTTA GGCACAATTA TAACTAGCTT ATTAATAATT TTTACCTCCT GTCAACAAGT ACCCAATCCC TCAAGTGTCA TTATTAGTAG TGCTTATAAT AGTGGTTGGA TAGAAGAATT ATTTCAAACT GAAATTGTTA ATATTGGTCT AAAACAATTA GGCTACAAAG TCAATAAACC AAAACAAAGT GATTATCCAG TAATTTATAT TTCTATTGCC AATGGAGACT TAGAATACAG CACAGTTTAC TATCAACCAG GGCATGAAAA ATTCTTCACA AATGCTGGTG GAGAAAAAAA ATTAGAGGGA GTTGGATTTT TAACACCCCC AGGAAAGCAA GGATATCAAA TTGACAAAAG AACGGCAGAT AAATATAATA TTACTAATCT TAAACAATTA AAAGATCCAG AAATAGCCAA ACTTTTTGAT TCAGATAAAA ATGGCAAAGC AAATTTAGTT GGTTGTAATC CTGGTTGGGC TTGTGAGTTA AATATAGACC ATCAACTTGA AAAGTATGAA CTGGAAGATA CAGTAGAACA TAATAGTGGA CAATATACAG TGCTTCTGGC AGATGCTATT ACTCGTTATA AGAAAGGCGA ATCTATTCTC TATTATGCCT ATAATCCCCA CTGGATATCT GCAGTTTTAA AACCAGGGGA AGATGTAGTC TGGTTAACAG TTCCTTTCAC ATCTTTACCA AATAATATGG CAAGTCTAAG TGAAAAAGAT ACGTCAGTAG ATGGAAAAAA TCTTGGTTTT CCAAGAAGTA AACAGAGCAT TGTGGCTAAT AAAAAGTTTA TAGAATCTAA TCCAGTAGCT AAAAAGTGGT TTGAATTAGT AGAAATTCCA ATAGCAGATA TGAATAGGGA AAGTTTACGC ATTAAAGAGG GTGAAGACAA AGAAGAAGAT ATTTTGCGTC ATGCTCAAGA ATGGGTAAAA AATAATCAGG AAAAATATGA TACTTGGTTA AAAATTGCTA GACAAGCATC TAATTAA
|
Protein sequence | MKKTPLKIIL GTIITSLLII FTSCQQVPNP SSVIISSAYN SGWIEELFQT EIVNIGLKQL GYKVNKPKQS DYPVIYISIA NGDLEYSTVY YQPGHEKFFT NAGGEKKLEG VGFLTPPGKQ GYQIDKRTAD KYNITNLKQL KDPEIAKLFD SDKNGKANLV GCNPGWACEL NIDHQLEKYE LEDTVEHNSG QYTVLLADAI TRYKKGESIL YYAYNPHWIS AVLKPGEDVV WLTVPFTSLP NNMASLSEKD TSVDGKNLGF PRSKQSIVAN KKFIESNPVA KKWFELVEIP IADMNRESLR IKEGEDKEED ILRHAQEWVK NNQEKYDTWL KIARQASN
|
| |