Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_1066 |
Symbol | |
ID | 4241951 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 1667703 |
End bp | 1670750 |
Gene Length | 3048 bp |
Protein Length | 1015 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 638106296 |
Product | hypothetical protein |
Protein accession | YP_720908 |
Protein GI | 113474847 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.619549 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATTATT CGATGAGCCC TAATACTGAA CCACAAATTA GTTATTTCCG AGAATGGACT AATAGCTGTG TTGATGACCA ATTAATCCAC CTTAACGTTG TCCCATTAGA AGGACAACGA GCTTATGAAT TTTTATTTTA TTCTGATGCT ATTCCTCGAC GAAATGATGG TCGAGTCACA AGCGAAATAT TAAACCGATA CCAACATATT GAAGAAGGTG GGTGGTGGTG TTCTGGAATT GATTTATTAT CAGGAGAAGA AGATTTTTGG GGTTGTTTTA AACCTAGTCA ACCACGTCAT AGTTACGACG AAAAAAAAAT AATTAAATAT GAGCACCCTC CCAAAACTCC TACTAGTTTA TTTGCTCTAA AAATTCCCCT ACATTTATGG CATAAAATAG CGAGTCATTA TCAATTAACA ATTTTGCCAG AAGATATTGA TAATAATCAA CCAGACTTTG GTTTTTGGCA GTGGTTTATC GCCCATCCTC AAATACCTTT ATGTATTACT GAAGGGGCAA AAAAAGCAGG AGCTTTATTA ACAGCATCCT ATGTAGCTAT TGCCTTACCA GGAGTATTTG GTGGATACCG AGTTTTGAGA GATGAATATG GTAACCGTAT TGGTAAACAA CATTTAATTC CCCAGTTAGA AAAGCTGATT AATAATACTC GAGAAATTTA TATTGCATTT GACCAAGATA CTAAAGCTAA AACTATTAAA AATGTTAATG CTGCCATTAG AAAAACTGGA TATTTACTCA TAAAAAAGGG ATGTAAAGTT AAAGTAATTT CCTGGAATCC AGAATTAGGT AAAGGAGTAG ATAATTTAAT CGCTAATCAT GGAAAAAATG TTTTTGATGA AGCTTATAAA AAGGCATTAC CTTTAGAACT TTGGAAAGCT AAATCATTTA TTCGTCTCAC ATATCCGGTA AATTTGAGAG TTAATAGTCG CTATCTTTCA GAACAAAATA TATTTAATTC TATAGACAAT AATAACAATA ACTTACATAA ATTAGACGAC ATTGATTTAG ATTATAGTAT AAATTTTCCC GCTAAACTTA TAGGAATAAA ATCTGCTAAA GGAACTGGAA AAACTAAGTT TTTAGAAAAA ATAGTTTCTG AAGCTGTAGC TCGTAATCAA AAAGTTTTAG TTATCGGGCA TCGGGTACAA TTAGTACAGG AATTATGTCA ACGTTTTGGA TTAAAATATA TTACAGAAGT TAACTCAAAA TCCCCAGATA AATTATTAGG TTTAGGGTTA TGTATTGACT CTTTACATCC TAATTCCCAA GCTAACTTTA ATCCAGAAAC TTGGTCAGAT GGAATAGTAA TTATTGATGA AATTGAGCAA GTAATTTGGC ATGCTTTAAA TTCTAATACT TGTAGGAAAA GTAGAGTAAA AATTCTCAGA TGTTTTAAAG CGTTAATGCA AAATATTTTA GGGGGTGCAG GTAAAGTATT TATAGCTGAT GCTGACCTCA GCGATATTTC TATAGATTAT TTACAAGCTT TAGCAGGAGT AAAATTAGAA CCTTTCATTG TTCAAAATGA TTGGTTACCT GGAGAAAAAG AAGCTTGGAA AATTTTTAAT TACCCAGAAA CAACTCCCAA AAGATTAATA GCAGATTTAC AAAAACATAT TCGTGAAGGT GGCAAACCAA TAGTTTGTTT ATCAGCGCAA AAACTGACAA GTAAATGGGG GACTCGTGCC TTAGAAGCTT ATCTGAAAAA ACAGTTTCCC AAATTAAAAA TTCTGAGAAT AGATTCAGAA TCTTTAGCAG AAGTAAATCA TCCTGCTTAT GGTTGTATTA AGTCATTAAA TCAAGTATTA CTAAAGTATG ATATTGTTTT AGCTTCTCCC TCAATTGAAA CTGGAGTTAG TATTGATGTT CAAGGGCATT TTACTTCAGT TTGGGCAATT GCTCAGGGGG TGCAGGGAGC CACTTCTGTT TGTCAATCTT TGGGTAGAGT TCGTGAGAAT ATTCCTAGAT ATTTGTGGGT AGCTAATTGT GGTTTTAATC AGGTAGGTAA TGGTTCTACT TCTATAACTT CTTTGCTCAA TTCTGAGCAA CGTTTGACTC AATTAAATAT TCGGTTATTA CAACAATCTG ATTTTGATAG TTTAGATGAT TTGGAAGTAG GTTTTCAGGC AGAATCTTTT TTGTGTTGGG CAAAAATGGC GGTCCGCTTT AATGCTGGAA TGAATCAATA TAGAGAGTCA GTTTTAGAAT TTTTACGAAT AGAAGGACAG CAGATCATAG AAGTTTCTGC AGAAGCTTTA CCTGAAAATT TAGAAGAAAA AAAATCATCA GAAACTCCTG AAGAAATTAA TACTTTGCAG GAAGCTATAG CTATTGTAAT TAAGCAAAAC TATCAGACAG AATGTGAGGC GATCGCTACT GCGAAAAGTA TTAGTTTATT TGAGTATCAA AAACTCACAA AAAGATTATC AAAACCAATT CAACAACAAC GAGAACAACG TAAATTTGAG TTGATGTTAC GTTATAGTAT TCCCATAACT GCTGAATTAG TTCAGAAAGA TGATCGGGGG TGGTACCAAC AGTTGCAATT ACATTATTTT ATGACAGTAG GGCGACCGTA TTTACCAGCC CGGGATGGGG AAGTAGTAAA AAAATTGTTG GAGTTAGGTA AAGGTAATAT TTTTATTCCT GATTTTAATG ACTCTTTATT AGGTGCAATT ATTGGAGTAA TAGAATTATT AAAAATACCT TCATTATTGA AGGATAAAAA ACGAGAGTTG AAAAATATAG ACCCAGATTT ACAATTATTA GGAAAAACAG CTTTGTCAAA CCGAACAGAA ATTAAAACTA TATTAGGAAT AGGATTAGCT GCTAACTCTA GCCCCATTAT AATTGTTAGG CGTTTTTTGG AGAAAATTGG CTATAGTTTG GAATGTTTAC GAACAGAAAC TCACCATAAA AAACGATTGC GAATTTATCG AATTTTTCAT CCTGATGATG GTAGGTTTGA AGTATTTCAG CAATGGTTGA GCTCAAGCCA TAACTCAAAG GTCAGAACTT GTGTATAA
|
Protein sequence | MHYSMSPNTE PQISYFREWT NSCVDDQLIH LNVVPLEGQR AYEFLFYSDA IPRRNDGRVT SEILNRYQHI EEGGWWCSGI DLLSGEEDFW GCFKPSQPRH SYDEKKIIKY EHPPKTPTSL FALKIPLHLW HKIASHYQLT ILPEDIDNNQ PDFGFWQWFI AHPQIPLCIT EGAKKAGALL TASYVAIALP GVFGGYRVLR DEYGNRIGKQ HLIPQLEKLI NNTREIYIAF DQDTKAKTIK NVNAAIRKTG YLLIKKGCKV KVISWNPELG KGVDNLIANH GKNVFDEAYK KALPLELWKA KSFIRLTYPV NLRVNSRYLS EQNIFNSIDN NNNNLHKLDD IDLDYSINFP AKLIGIKSAK GTGKTKFLEK IVSEAVARNQ KVLVIGHRVQ LVQELCQRFG LKYITEVNSK SPDKLLGLGL CIDSLHPNSQ ANFNPETWSD GIVIIDEIEQ VIWHALNSNT CRKSRVKILR CFKALMQNIL GGAGKVFIAD ADLSDISIDY LQALAGVKLE PFIVQNDWLP GEKEAWKIFN YPETTPKRLI ADLQKHIREG GKPIVCLSAQ KLTSKWGTRA LEAYLKKQFP KLKILRIDSE SLAEVNHPAY GCIKSLNQVL LKYDIVLASP SIETGVSIDV QGHFTSVWAI AQGVQGATSV CQSLGRVREN IPRYLWVANC GFNQVGNGST SITSLLNSEQ RLTQLNIRLL QQSDFDSLDD LEVGFQAESF LCWAKMAVRF NAGMNQYRES VLEFLRIEGQ QIIEVSAEAL PENLEEKKSS ETPEEINTLQ EAIAIVIKQN YQTECEAIAT AKSISLFEYQ KLTKRLSKPI QQQREQRKFE LMLRYSIPIT AELVQKDDRG WYQQLQLHYF MTVGRPYLPA RDGEVVKKLL ELGKGNIFIP DFNDSLLGAI IGVIELLKIP SLLKDKKREL KNIDPDLQLL GKTALSNRTE IKTILGIGLA ANSSPIIIVR RFLEKIGYSL ECLRTETHHK KRLRIYRIFH PDDGRFEVFQ QWLSSSHNSK VRTCV
|
| |