Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_1998 |
Symbol | |
ID | 4243425 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 3112190 |
End bp | 3113221 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 638107113 |
Product | extracellular solute-binding protein |
Protein accession | YP_721720 |
Protein GI | 113475659 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | [TIGR01096] lysine-arginine-ornithine-binding periplasmic protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATAATAC CAATTACAGC TTGTGAAGTT GAAGGTACAT CAGACCAAAA AGAAGTTACT CAGGAAAATA AACAAGTCAC TAGTCGTTTA GATATTGTCA AAAATCGTGG TAAACTGATC TGTGGTGTAG AAGGTGGTAT TCCTGGATTC AGTTTTGTAG ACAAGAATGG TAACTACTCA GGAATAGATG TAGATATCTG TAAGGCGGTA GCGGCGGCAT TATTCAATGA TCCAAATCTA GTAGAGTATC GTAACTTAGA TTCAACGGAG CGTTTTACTG CTCTTAACGG TGGGGAAGTA GATATGCTTT CTCGCAACAC AACATGGACT GTTAGTCGAG ATACTACTGT TGGTCTTGAG TTTGCCCCTA CTACATTTTA TGATGGTCAA GGCATGATGG TTCGTGCTAA TAGTGGAGTT GAATCTTTAG AGGACTTGCA AGGCAAATCA ATTTGTGTCG AAGCAGGGAC AACTACAGAA TTAAACTTAA CAGATAATCT CCGCCAACGA AATGTGACAG CTGAGACATT AACGTTTCAA CAAGCAGACC CAGCTTATGC AGCTTATGCT GAAGGGCGTT GTGATGGGAT GACTTCTGAT AAGTCTCAAT TATTGAGTCG TCGTAGTACT CTACCAAGTC CAAATGATCA TGTTATTCTA CAGGTCACTA TGTCCAAAGA GCCTTTAGGT CCAGTCACAA AAAATAATGA TTCTGCTTGG TTTGACGTGG TTAAATGGGT GACTTATGCT TTAATAGAAG CTGAAGAGTT AGGTATTACT CAAGAAAATG TAGATGACTT GAAGCAAAAT TCTGATAACC CTACTATAAG GCGCTTTTTA GGAGTAGATG GAGACCTGGG TGAAGGTTTG GGTTTAAGCA ATGATTTTGC TTATAGAGTA ATTAAGAATG TTGGAAACTA CGCTGAAGTC TATGATCGTA ACTTAGGAGA AGAATCTCAG TTTAAGTTAC CTCGGGGTAT GAATAATTTA TGGACGAAGG GTGGTTTGCT TTATTCTCCT CCATTCCGTT AG
|
Protein sequence | MIIPITACEV EGTSDQKEVT QENKQVTSRL DIVKNRGKLI CGVEGGIPGF SFVDKNGNYS GIDVDICKAV AAALFNDPNL VEYRNLDSTE RFTALNGGEV DMLSRNTTWT VSRDTTVGLE FAPTTFYDGQ GMMVRANSGV ESLEDLQGKS ICVEAGTTTE LNLTDNLRQR NVTAETLTFQ QADPAYAAYA EGRCDGMTSD KSQLLSRRST LPSPNDHVIL QVTMSKEPLG PVTKNNDSAW FDVVKWVTYA LIEAEELGIT QENVDDLKQN SDNPTIRRFL GVDGDLGEGL GLSNDFAYRV IKNVGNYAEV YDRNLGEESQ FKLPRGMNNL WTKGGLLYSP PFR
|
| |