Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_2032 |
Symbol | |
ID | 4243636 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 3159119 |
End bp | 3160816 |
Gene Length | 1698 bp |
Protein Length | 565 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 638107146 |
Product | hypothetical protein |
Protein accession | YP_721749 |
Protein GI | 113475688 |
COG category | [R] General function prediction only |
COG ID | [COG4188] Predicted dienelactone hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.347615 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0686681 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACTGC AATCATTAAA AGCAAAATTT ATTTCATTTT CCTTAGGATT AAGCATATTA TCACCTATTT TTACTCCTAT TCTTGCTATT GCGGCTGAAC AAATTATAAT TTCTGTTCCC GATATAAATA TTAAAAATCC AATAGAAGTT TATATCCGAG ATTTTCAAAT TAAAATATTA GTAAAATCTC TAGAAAACTT GGCTGAAGAA GGTAAACAGA CAGAAGAATT TAATCTTTAT TCGGAACTAC TAAAAGAAGA AGTATATGAA GTTCTGCAAA AGCCTTTTAA CTTAGATCCT ACTTTTGTAG AAGCTTTTAT GAAAGAACCA ATGGGAAAAG AAATGCTAAA ACGTTTAGGT AAAATTATCC AAACAGGAGA CAAGCAAAAT GGTAGTGAAG CTATCAAAAA ATCCTTTCAG TTAGCTCTTG CAGATCCAAA TGGTTTAACT ATTATTAATT TACTTAATAA ATTTCCTGGA GATATTTTTA TAGAATTACA AGAAGGTGTG AATTTAGTTG ATGAGTTAAT AAAAAATTTT GTAGAAAAAA ATAGAATTTT AACAGGATTA CAAAGACAAG CAGAAGATTT AGCATTAGCA GAAAAAAAAC GTAATTTTAA GTTTTTTCCA GACTTAAGAA TGCCTGGGAA TATAACATTC AAAAAACAAT CTTTTAGCTT TAGAAATCCC TATCGTAAAC GATCTTCTCC TGTAGATATT TATCTGCCTA ATATTTCTGG AAATAACTTA ATTCCTGTGG TAGTTATTTC TCATGGTTTA GGTTCAGATA TAAGTAGTTT TGTCTATCTT GCTAAACATT TAGCATCTCA TGGTTTTGCT GTAATAGTAC CAAAACATAT TGGTTCAGAT GCAGAAAAAC TTGAAAAAAT GTTTGCAGGT CTTAGCAGAC CTGTAGATGG GATGACATTT ATTAATCGCC CACTAGATAT TAAGTATTCG CTAGATGAAA TAGAAGAAAA AATAGGATCC GACCCTATAT GGCAAGGTAA GCTAAATTTT CAAAACGTAG GGATAATAGG TCAATCTTTT GGTGGTTATA CTGCTTTAAG TGTTGCAGGT GCTCCTCTCA ACATCAAACA ACTAAGTAAA GACTGCCAAT CTGAAGATAT TCAGTTAATC TTAAATTTAT CCCTGTTGTT CCAGTGTCAA ATGACTGATT TAGCGAAAGA AGTTACTCCC AGTTTAGCAG ATGAAAGAAT CAAAGCAGTT ATTGCAATTA ATCCTATTAG TAGTATTTTA TTTAGAACTG ACAGTCAAGG TATGAGTAAA GATATGAGTG AAATTAAAGT TCCTGTGATG ATTATTACCG GCACTGAAGA CTTATTTGCT CAGCCAATTC CAGAGCAAAT TTATCCATTT ATTAGCTTAA CTACCCCAGA AAAATATTTA GTTATAAGTA AACCAGGAAC TCATTTTTCA TTTATTCAAG AAGAAGAGAA TGTTCCTGTA GAACTACCAA AAAAATTAAT TGGTCCTGAC CCTAATTTAG CTTATCCATA TCTTCAGGCA TTAAGTGTAG CTTTCTTTCG GGTTTATATT CAAAATCAAT CTGAGTCATT ACTATATTTA AGTGAGTCTT ATCTACAGTA TCTCAATCAA GAACCATTTA CTTTTAGTTT ACTGAAATCT CTTACAGAAG TTGATATACA AAAAGTGCTT GAGAGTTATT ATGAATAA
|
Protein sequence | MKLQSLKAKF ISFSLGLSIL SPIFTPILAI AAEQIIISVP DINIKNPIEV YIRDFQIKIL VKSLENLAEE GKQTEEFNLY SELLKEEVYE VLQKPFNLDP TFVEAFMKEP MGKEMLKRLG KIIQTGDKQN GSEAIKKSFQ LALADPNGLT IINLLNKFPG DIFIELQEGV NLVDELIKNF VEKNRILTGL QRQAEDLALA EKKRNFKFFP DLRMPGNITF KKQSFSFRNP YRKRSSPVDI YLPNISGNNL IPVVVISHGL GSDISSFVYL AKHLASHGFA VIVPKHIGSD AEKLEKMFAG LSRPVDGMTF INRPLDIKYS LDEIEEKIGS DPIWQGKLNF QNVGIIGQSF GGYTALSVAG APLNIKQLSK DCQSEDIQLI LNLSLLFQCQ MTDLAKEVTP SLADERIKAV IAINPISSIL FRTDSQGMSK DMSEIKVPVM IITGTEDLFA QPIPEQIYPF ISLTTPEKYL VISKPGTHFS FIQEEENVPV ELPKKLIGPD PNLAYPYLQA LSVAFFRVYI QNQSESLLYL SESYLQYLNQ EPFTFSLLKS LTEVDIQKVL ESYYE
|
| |