Gene Tery_2032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2032 
Symbol 
ID4243636 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp3159119 
End bp3160816 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content31% 
IMG OID638107146 
Producthypothetical protein 
Protein accessionYP_721749 
Protein GI113475688 
COG category[R] General function prediction only 
COG ID[COG4188] Predicted dienelactone hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.347615 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0686681 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTGC AATCATTAAA AGCAAAATTT ATTTCATTTT CCTTAGGATT AAGCATATTA 
TCACCTATTT TTACTCCTAT TCTTGCTATT GCGGCTGAAC AAATTATAAT TTCTGTTCCC
GATATAAATA TTAAAAATCC AATAGAAGTT TATATCCGAG ATTTTCAAAT TAAAATATTA
GTAAAATCTC TAGAAAACTT GGCTGAAGAA GGTAAACAGA CAGAAGAATT TAATCTTTAT
TCGGAACTAC TAAAAGAAGA AGTATATGAA GTTCTGCAAA AGCCTTTTAA CTTAGATCCT
ACTTTTGTAG AAGCTTTTAT GAAAGAACCA ATGGGAAAAG AAATGCTAAA ACGTTTAGGT
AAAATTATCC AAACAGGAGA CAAGCAAAAT GGTAGTGAAG CTATCAAAAA ATCCTTTCAG
TTAGCTCTTG CAGATCCAAA TGGTTTAACT ATTATTAATT TACTTAATAA ATTTCCTGGA
GATATTTTTA TAGAATTACA AGAAGGTGTG AATTTAGTTG ATGAGTTAAT AAAAAATTTT
GTAGAAAAAA ATAGAATTTT AACAGGATTA CAAAGACAAG CAGAAGATTT AGCATTAGCA
GAAAAAAAAC GTAATTTTAA GTTTTTTCCA GACTTAAGAA TGCCTGGGAA TATAACATTC
AAAAAACAAT CTTTTAGCTT TAGAAATCCC TATCGTAAAC GATCTTCTCC TGTAGATATT
TATCTGCCTA ATATTTCTGG AAATAACTTA ATTCCTGTGG TAGTTATTTC TCATGGTTTA
GGTTCAGATA TAAGTAGTTT TGTCTATCTT GCTAAACATT TAGCATCTCA TGGTTTTGCT
GTAATAGTAC CAAAACATAT TGGTTCAGAT GCAGAAAAAC TTGAAAAAAT GTTTGCAGGT
CTTAGCAGAC CTGTAGATGG GATGACATTT ATTAATCGCC CACTAGATAT TAAGTATTCG
CTAGATGAAA TAGAAGAAAA AATAGGATCC GACCCTATAT GGCAAGGTAA GCTAAATTTT
CAAAACGTAG GGATAATAGG TCAATCTTTT GGTGGTTATA CTGCTTTAAG TGTTGCAGGT
GCTCCTCTCA ACATCAAACA ACTAAGTAAA GACTGCCAAT CTGAAGATAT TCAGTTAATC
TTAAATTTAT CCCTGTTGTT CCAGTGTCAA ATGACTGATT TAGCGAAAGA AGTTACTCCC
AGTTTAGCAG ATGAAAGAAT CAAAGCAGTT ATTGCAATTA ATCCTATTAG TAGTATTTTA
TTTAGAACTG ACAGTCAAGG TATGAGTAAA GATATGAGTG AAATTAAAGT TCCTGTGATG
ATTATTACCG GCACTGAAGA CTTATTTGCT CAGCCAATTC CAGAGCAAAT TTATCCATTT
ATTAGCTTAA CTACCCCAGA AAAATATTTA GTTATAAGTA AACCAGGAAC TCATTTTTCA
TTTATTCAAG AAGAAGAGAA TGTTCCTGTA GAACTACCAA AAAAATTAAT TGGTCCTGAC
CCTAATTTAG CTTATCCATA TCTTCAGGCA TTAAGTGTAG CTTTCTTTCG GGTTTATATT
CAAAATCAAT CTGAGTCATT ACTATATTTA AGTGAGTCTT ATCTACAGTA TCTCAATCAA
GAACCATTTA CTTTTAGTTT ACTGAAATCT CTTACAGAAG TTGATATACA AAAAGTGCTT
GAGAGTTATT ATGAATAA
 
Protein sequence
MKLQSLKAKF ISFSLGLSIL SPIFTPILAI AAEQIIISVP DINIKNPIEV YIRDFQIKIL 
VKSLENLAEE GKQTEEFNLY SELLKEEVYE VLQKPFNLDP TFVEAFMKEP MGKEMLKRLG
KIIQTGDKQN GSEAIKKSFQ LALADPNGLT IINLLNKFPG DIFIELQEGV NLVDELIKNF
VEKNRILTGL QRQAEDLALA EKKRNFKFFP DLRMPGNITF KKQSFSFRNP YRKRSSPVDI
YLPNISGNNL IPVVVISHGL GSDISSFVYL AKHLASHGFA VIVPKHIGSD AEKLEKMFAG
LSRPVDGMTF INRPLDIKYS LDEIEEKIGS DPIWQGKLNF QNVGIIGQSF GGYTALSVAG
APLNIKQLSK DCQSEDIQLI LNLSLLFQCQ MTDLAKEVTP SLADERIKAV IAINPISSIL
FRTDSQGMSK DMSEIKVPVM IITGTEDLFA QPIPEQIYPF ISLTTPEKYL VISKPGTHFS
FIQEEENVPV ELPKKLIGPD PNLAYPYLQA LSVAFFRVYI QNQSESLLYL SESYLQYLNQ
EPFTFSLLKS LTEVDIQKVL ESYYE