Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_2117 |
Symbol | |
ID | 4243953 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 3308007 |
End bp | 3309710 |
Gene Length | 1704 bp |
Protein Length | 567 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 638107224 |
Product | hypothetical protein |
Protein accession | YP_721825 |
Protein GI | 113475764 |
COG category | [R] General function prediction only |
COG ID | [COG4188] Predicted dienelactone hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.383292 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAAAAG TAGCTCAAAA AAACAACAAC CTTACAAGTA ATTATTATGG TTTGAGCCGT TGGTTAAAAT CTCTAACTCT AGGCATAATT CCTGCTATTT TAACTGCTTT ACCAGTGAAT GCAGCCAAGC ATATTTCTGT TGGTCACGAA GGTTTAAATA CTTTGATTTC AGTATCAGCA TTAGAAAGTT ATGTTCAAAA TGGAGAAATT AACTATGAAC TAGCTAGCTA TATGCTACTT TTAGAAGCTG AAGACGAACA GGAATACCGA GACTTCTTGC AAACTCGCTA CAATTTTCCA CCTCAAATAG TTGCTCAATT TCTTCACTCA CCTATGGGAA AATTATTCCT CAACAGTTTA GGAGAAGTAA TTAAAACTCA TTCAGTTAAT AGTGGTTCAT CCTCTATTGA AAATGGTGCA GAAGCAATTA AAACAGCCTT ATTAAAATCT GCAAATGACT CTGCAGGCTT AAGTATTATA AATTTTGTCC GTCATTTTCC CAGTGATGTT ATGTGGCTAG AAGCAAAAAA AATTGTAGCT GTGACTGAAA AAATTTCTAC TGTAGAACAA GAGACAAAAG CTTTGGTTGA AACAGTAGCA GAACTAACAT TAAAAGAAGC TAATCAAGAA AAAAGAGTAG ATTTCTCGCA ATTACCGGAT ATTAGGAAAA AGGGAAAATA TAGCTTTTCA CAACAAAAAA TTACTCTAGA AGATACCGGA CGTAATCGCA ATTTTTCAGC CCAATTTTAT TTGCCAAATA TATCTTCAGA AGAAACCTCA ATTCCTGTAG TAGTTATTTC TCATGGTTTA GGTTCTAATG GGGTAAACTT CCAGAGCTTG GCCGAACATT TAGCCTCTTA TGGATTTGCT GTTGCCCTAC CTCAACATAG AGGTAGTAAT TATGAATATA TCCAGAAATT CTTAGCAGGT AAAACTCAAG ATATGTTTCA AGGGAATGAG TTTATCGATC GCCCTTTAGA TATTTCTTTC TTATTAAATG AATTAGAACA GTTAAATAAA TATCAATTAC AAGGTCAGTT AAACTTAGAA AAAGTTGGCA TATTTGGTCA TTCTTTTGGA GCCTCTACGG CCTTTGCCTT AGCAGGTGCA GAGATTAACT TTAATCAGCT CCAACAAGAC TGTGGTCCAC AAATGGAAAT TTTGAATATG TCGCTGCTGC TACAGTGCCG TGCTTTGGAA TTGAAACCAC AAAAATATAA CTTGAAAGAT GATAGAATTG GAGCAATATT TGTACTTGAC CCAGTGAATA GTAGTTTATT TGGAAAAGCT GGTATCAGTC AGATTAAGTT ACCTGTTTTA TGGGGAAGTG GGAGTGAAGA TAAAATTACT CCCATAGTTT TAGAACAAGC AAACTCTTTT ACTTGGTTGC CAACTACCAA TAAATATTTG GTATTAACAG AGGGAGCAGA TCATATTAAT ATTAATCTGG GTGCAGTTAA CAAAAATGCC TTTACTTCTT TAGAAGAGTT AATTAAGCCA GACCCAGAAG TAGTAGTTGG TTATGCTAAT GCTTTTGGTT TAGCATTTTT CCAAAGCCAC GTTGCCGATC GCCCTGAATA TCATTCTTAT TTACAAGCAT CTTATGCTAA GGCGATCAGC GAGCGACCTT TTAATCTGAG TTTAGTGCGT TTTTTGCCAG AAACTCAGTT AAACAAAGTA TCAAAACAAG CAAGGAAAGA CTAA
|
Protein sequence | MEKVAQKNNN LTSNYYGLSR WLKSLTLGII PAILTALPVN AAKHISVGHE GLNTLISVSA LESYVQNGEI NYELASYMLL LEAEDEQEYR DFLQTRYNFP PQIVAQFLHS PMGKLFLNSL GEVIKTHSVN SGSSSIENGA EAIKTALLKS ANDSAGLSII NFVRHFPSDV MWLEAKKIVA VTEKISTVEQ ETKALVETVA ELTLKEANQE KRVDFSQLPD IRKKGKYSFS QQKITLEDTG RNRNFSAQFY LPNISSEETS IPVVVISHGL GSNGVNFQSL AEHLASYGFA VALPQHRGSN YEYIQKFLAG KTQDMFQGNE FIDRPLDISF LLNELEQLNK YQLQGQLNLE KVGIFGHSFG ASTAFALAGA EINFNQLQQD CGPQMEILNM SLLLQCRALE LKPQKYNLKD DRIGAIFVLD PVNSSLFGKA GISQIKLPVL WGSGSEDKIT PIVLEQANSF TWLPTTNKYL VLTEGADHIN INLGAVNKNA FTSLEELIKP DPEVVVGYAN AFGLAFFQSH VADRPEYHSY LQASYAKAIS ERPFNLSLVR FLPETQLNKV SKQARKD
|
| |