Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_1451 |
Symbol | |
ID | 4245765 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 2191817 |
End bp | 2193394 |
Gene Length | 1578 bp |
Protein Length | 525 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 638106604 |
Product | periplasmic binding protein/LacI transcriptional regulator |
Protein accession | YP_721214 |
Protein GI | 113475153 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins [COG1879] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.137512 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTAATC TTACCCTTAG TCCTTGTCAA TCTTTATCTT TCACTCAAGG CTCACTTAAG GTTAATAAAT ATTCAATGGA TCTCTCCTAT TCAAATTTAA CAGGAGCAAA TCTCTCAGGT GCTAATCTTG CCGGAATTAA TTTACAGGGG AGTAATCTTC AAGGTGCAAA CTTGGTAAAT GCTAATTTAG AAGGTGCGAA TTTAAAGGAC GTCAATTTAG AAGGTGCTAA TTTAGCACGC GCTAACTTAA AGAAAGCCAT ACTTCAAAAT AGTAATTTAG ATAATAGTAA TTTGTATGGA TCTGACCTTC AAGCAGCTGA TTTCTCTGAA GCTAATCTCG TCAATATGAA AGCTTTATGG GCTAATTTTC ATAATGCTAT TTTCCATCGG GCAAACTTAG AATCAGCGAA TTTTAATCGA GCAAATTTAA GAGGGGCTGA TTTTTATAAA GCTAATCTAG AGAATGCCTC CTTGCGTTTT ACCGATTTTG GTAGCACGAC TAACGTGATC GAAGCTAAAT TAAACCCCAC TAATTTCCGA GAAACTCAGT TAAAAGGTGC TGATCTGTGG GGAGCAAAAA TGTGGTCAAT ATTTCAAATT AAACAGGCTA AAAATTGGCA GGAAACCAAT AGGATGCCTA ATTGGGAACA GCAAATCAAA CAAGCACGCT TACCCCGTTT AAGGATAGCT TTGCTCAAAC CAGAAAATGC TGACAGTATT TCTGATACCT ATGAATTTGG GATGCGTCGT GCTGCTAACC GTCGTGTGGA AATTTGGGGT ATTTCTTATC CCGGTGGTGT GAAGAACGAA GCGAAAATAA TTAGGCAGTT GATTAAGGAT GGCATGGATG GGATTATCTT AACGCCTGAA GATCCTGTTC AGTCCCTTGA TGCCCTGAAA CTGGCAAGGG ATGCTGGAGT AGCTATTACC ACTGTTGATT TCTGCTTTAA TCCTATAGAT GCAGAAGATT TAGCTATTGC TTGCTATAAT ACAAATAGTT TTCAAATGGG CTATGATTCA GGCCAATATA TAGCAGAATG GGCTCAAAAG AATTTACAGT CAAAATCGGT TCAAATTGGT TTAGTTGATG GTGCTGTATA TGATCGCTAC TATCCCTATC TTCAAGGAGT CTTAAAAGCG ATCAACCATT CGGGTATCCC TTTTCAAATT GTTGACTCTG TTAGCATTGC TTTTGGTAGT GATATTATAA AAGTTAAAAA ACTACTTGAA GATAATCCTG ATGTTCAGAT ACTTTGGGGG GGATCAAATA TAGCAACGGA GGTTACAGTT GCAGCAGTAG CAGAATCTGC CTTGAAGAAT AAAGTTAAGG TTTTTGGAAT TTTAGACCTC TCAAGAAATA AAGCTACAAA ATTACTTAAT CCCAATAGTC CTTTACAGTT GATTATTGAG CAGTCTAGTA TTCAGATTGG TTATGAGGCT GTGAAAACAA CAATTTCTGT CTTGAGAAAA GAAATAGACG GAGCAGATTA TCAGGTTTAC CCCGTTAAGC ACCGTCTATT AACTCAAAAT GACCCAGACA TTGTAAGTGA TATACTCAAC GACTCAAGTT TAGAATAA
|
Protein sequence | MANLTLSPCQ SLSFTQGSLK VNKYSMDLSY SNLTGANLSG ANLAGINLQG SNLQGANLVN ANLEGANLKD VNLEGANLAR ANLKKAILQN SNLDNSNLYG SDLQAADFSE ANLVNMKALW ANFHNAIFHR ANLESANFNR ANLRGADFYK ANLENASLRF TDFGSTTNVI EAKLNPTNFR ETQLKGADLW GAKMWSIFQI KQAKNWQETN RMPNWEQQIK QARLPRLRIA LLKPENADSI SDTYEFGMRR AANRRVEIWG ISYPGGVKNE AKIIRQLIKD GMDGIILTPE DPVQSLDALK LARDAGVAIT TVDFCFNPID AEDLAIACYN TNSFQMGYDS GQYIAEWAQK NLQSKSVQIG LVDGAVYDRY YPYLQGVLKA INHSGIPFQI VDSVSIAFGS DIIKVKKLLE DNPDVQILWG GSNIATEVTV AAVAESALKN KVKVFGILDL SRNKATKLLN PNSPLQLIIE QSSIQIGYEA VKTTISVLRK EIDGADYQVY PVKHRLLTQN DPDIVSDILN DSSLE
|
| |