Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_2076 |
Symbol | |
ID | 4245724 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 3241214 |
End bp | 3243199 |
Gene Length | 1986 bp |
Protein Length | 661 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 638107187 |
Product | protein of unknown function DUF900, hydrolase-like |
Protein accession | YP_721790 |
Protein GI | 113475729 |
COG category | [S] Function unknown |
COG ID | [COG4782] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.445606 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTATATA ATTTCAGACT CAGAAAAGTC ATTCGGGAAT TACTTAATCA AGATCTACCC GAAGAAGAAT TTAACGACTT AGTTTATGAT TATTTTCCTG ATGTTTATAA CCAATTTACA AATGGACAAA ATAAAAAGCA AAGAGTCAGA ATTTTAATTG AATATGCTGA CAAACATAGA GAAATTGAGC GGTTACTTGA AGGCATTAAA AATATTAATC CAAAAGTTTA TCAAGAGTAT GAGTCAAAAT TAGGAGAAAA TCCCCCTCCG CCTCCAATTG AAAAATGTGA TGTTTTGGTT TTAGCAGCAA ACCCTACAAC TACACAGCCA CTACAATTAA AAAAAGAAAC TGAATTAATT AGGGAAAAAC TACAGCAGAC AGAATTTGGA AAAAATTATA TTGTTTATGG AGAAGAAAAT GCTTTTATAG AAGATTTATC TCAATATTTG CTGAAATATG AGCCTAGAAT TCTTCACTTT AGCGGTCATG GTAATTCTCA AGGTGAAATA ATTTTAAACA ACCGTCAAGG TGAGGCAGAG GTTTTATCCC TCGAAACATT ATCAGAATTA TTATCTATTG TTAGAAAAGA TGGAAAACCT ATAGAATGTG TTGTATTTAA TGCCTGTTTT TCTCTGAAAA AAGCTGATGC AGTCGCTCAC CAGGTAGGTT GTGTTATTGG CATGAAAAAA GAGATTGGTG ATGATTCTGC TTTGATATTT GCCGAAGAAT TTTATCAAGG TTTAGCATAT CAAAGGAGCT ATTATCAAGC TTTTCAACTA GGTATAAATG GAATTGAACG CTTAAGATTA CCTGATAGTC CAATTCCTCA TTTTATTCCT TTTGATACAT CATTATTAGA GTCAGAAACT GTCAGTTTAA GAAGTCATCA AACCAACGGT TATTTGACTT CAAAAGAAGC CGTAACAAAG AAAGCAATAA AGAAAAAAGA AACCGTAAAA GTCAAAAGAT CTCTGATTTT AAAAGATACT AAAGAAACAA CAGCAACTAT ATATCCTTTA TGGTTTGGTA CCAACAGAAA ACCTGTAGAT ACAAATAATA TATCCAAAGG TTTTTCCGGA AAAAGAGATG ACAAACTTCA CTATGGTATT TGTCAAGTAG CTGTTCCTAA ATCTCATAAA ATAGGCTCTA TAGGTTCCCC TTTGTGGAAA AGATTAATTA CTTTCAAAGA CGATCGCCTC AAACTACATT TTCAAAGTTT GCAAATTCTG GAAAAAGAAC TATTTTGGGA AAATATCAAC GAAGAATTAA AAGACCATGA AATAAATGAA AGGTCTGCTT TAGTCTTTGT TCATGGATAC AACGTCAATT TTGAAGATGC AGCTATTAGA GCCGCACAAA TGGGGTTTGA CCTGCAAGTG CCAGGAATTA CAGCCTTTTA TAGTTGGCCA TCTCAAGGGA AATTATCAGC ATATCCGGTA GACGAGGCAA GTATTGAAGC CAGCGAAAAG TACATGACAG AATTTTTACT CAACCTAGCC GAAAAAACGG ACATTGAGAA AATTCATATT ATTGCTCATA GTATGGGAAA CCGAGGTTTA CTCAGAGCAG TCCAAAGAAT TATTTCTCAA GTTCAAACAA TAACTAATAT TGCTTTTGGG CAAATTATTT TAGCCGCTCC AGATGTAGAT ATTGACTTGT TTAAAGAGTT AGCTAAAGGA TATCATCAAT TAGCAGAACG AACTACATTA TACATATCAT CAAAAGACAA AGCCTTAGCA ACTTCAGCGC TTATTCATCA GCATGGCCGA GCTGGTTTTT TCCCCCCTGT TACTGTTGTA GAAGGAATAG ACACGGTAAA AGTTTCTAAG ATAGATTTAA CTTTATTAGG ACATGGTTAT TTTGCTGATG CTCGTTTGGT ACTTGAAGAT ATACGGGACT TATTAATTAA TAATACTTCC CCAGGGCAGC GAAGAGGTCG GTTAGAACCG TCGGAAGAGG GGGGTTATTG GATTATGCGG CAGTAA
|
Protein sequence | MLYNFRLRKV IRELLNQDLP EEEFNDLVYD YFPDVYNQFT NGQNKKQRVR ILIEYADKHR EIERLLEGIK NINPKVYQEY ESKLGENPPP PPIEKCDVLV LAANPTTTQP LQLKKETELI REKLQQTEFG KNYIVYGEEN AFIEDLSQYL LKYEPRILHF SGHGNSQGEI ILNNRQGEAE VLSLETLSEL LSIVRKDGKP IECVVFNACF SLKKADAVAH QVGCVIGMKK EIGDDSALIF AEEFYQGLAY QRSYYQAFQL GINGIERLRL PDSPIPHFIP FDTSLLESET VSLRSHQTNG YLTSKEAVTK KAIKKKETVK VKRSLILKDT KETTATIYPL WFGTNRKPVD TNNISKGFSG KRDDKLHYGI CQVAVPKSHK IGSIGSPLWK RLITFKDDRL KLHFQSLQIL EKELFWENIN EELKDHEINE RSALVFVHGY NVNFEDAAIR AAQMGFDLQV PGITAFYSWP SQGKLSAYPV DEASIEASEK YMTEFLLNLA EKTDIEKIHI IAHSMGNRGL LRAVQRIISQ VQTITNIAFG QIILAAPDVD IDLFKELAKG YHQLAERTTL YISSKDKALA TSALIHQHGR AGFFPPVTVV EGIDTVKVSK IDLTLLGHGY FADARLVLED IRDLLINNTS PGQRRGRLEP SEEGGYWIMR Q
|
| |