Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_4514 |
Symbol | |
ID | 4246168 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 6965009 |
End bp | 6965869 |
Gene Length | 861 bp |
Protein Length | 286 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 638109396 |
Product | extracellular solute-binding protein |
Protein accession | YP_723972 |
Protein GI | 113477911 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0322747 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAAGA AACTAAAAAT TATTATTGCT TCATCCCTAG CATTCTTGGC GGCTATTTTT CCTCAAAAAA TTAGTCATGC ACAAACACAA GAAACAATCA TTAGAAGAGA GTTAATAGTA GTAGGAGTTA GGGGAGATTC CCCTTGGTTT GGCTTCAGAA ATGATGGAAA ATGGACGGGA TATTGTATAG ATATTGCCTA TGCTTTAGCG GACCGCCTTA GTGCCAACCA GCTCGATCCT ATCAGGGTAA GATTAGTTAC ATCAACGACC CAAAGTCGTT GGGATTTGGT AACAAGTGGT AGGGTAGATT TAGAATGCGG TCCTAATTCT ATTAGTGCTG AGCGTGAAGC CAAGCATGGT ATTAATTTTT CTTCACCTTT TTTTATCACA GCAACACAAA TTTTAGCTAA AAAAGGTAGA CAAGAAGAAG ATGTGAAAGT GGGTAATGCC ACAGTTGGTG TAGTTAGGAA TACTACTAAT GAAAGCGATC TAAAGAGTGC TTATCCATTA GAAAAAATTG ATAATAAATA TTTAAGTAGA GAAGAAGGTG TTCAAGCTGT GATGGATGGT GAAGTTAGTG GTTTTGCTAG TGATGGAACT CTTTTGTTAG GTTCAGCGGA GATGATGAAC TTGGATATAG AAAATGATTA TACTTTTATT ACTATTAATC AAGAAAATGG CCGTCCTTTT TGTGCTGGTT ATGGCTTCAT ACTTCCAGGA GGGGAAGATA ATTCTAGTTG GCAAAGGTAT ATTAATGACT TTCTTGCTCA TAATAAAAAA GCCCAAAGAA TTAGAAAAAA ATGGTTGGGT AATTTAAGCG AGAATAGTCA AAGAATTTAT GATGCTTGTA CTCAGAAATA G
|
Protein sequence | MNKKLKIIIA SSLAFLAAIF PQKISHAQTQ ETIIRRELIV VGVRGDSPWF GFRNDGKWTG YCIDIAYALA DRLSANQLDP IRVRLVTSTT QSRWDLVTSG RVDLECGPNS ISAEREAKHG INFSSPFFIT ATQILAKKGR QEEDVKVGNA TVGVVRNTTN ESDLKSAYPL EKIDNKYLSR EEGVQAVMDG EVSGFASDGT LLLGSAEMMN LDIENDYTFI TINQENGRPF CAGYGFILPG GEDNSSWQRY INDFLAHNKK AQRIRKKWLG NLSENSQRIY DACTQK
|
| |