Gene Tery_0537 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_0537 
Symbol 
ID4244505 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp850299 
End bp852104 
Gene Length1806 bp 
Protein Length601 aa 
Translation table11 
GC content31% 
IMG OID638105848 
Producthypothetical protein 
Protein accessionYP_720462 
Protein GI113474401 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.148715 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00787083 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGTCAAATA CTTTAATATC CAAATCAAAA ATAACTAGAT TTTTATTATT CTTACATAAA 
AACTTAATAA ATAGAAGCAG GATATTTTGG TTAAGTTTAA CCATAGCAGT TGTATTAATT
TATGGTATTG AATATCTAAA AGCAGCTTTT GAAACTAAAT ATATAATCCA AGATGATGCA
CGACAACATA TATTTTGGAT GCGTCGTTTT TTCGATACAG AATTATTTCC CGAAGACTTA
ATAGCTAACT ATTTTCAGTC AGTAGCACCT TGGGGTTATC AAACTTTTTA TTGGTTAATA
ACATCTCTAG GTATCGACCC AATTTTTTTC GGTAAATTAT TACCAATATT TCTGGGATTA
ATTTCTACAA TTTACTGCTT TGGAATCAGT TTACAAATTC TGCCAATTCC CGCCGTCGGA
TTTTTGAGCT CATTTATTTT AAACCAAAGT TTATGGATGG AAGATGACTT AGTTTCTGCT
ACTCCCCGAG CATTTTTCTA TCCACTTTTT TTAGCATTTT TATATTACTT ACTCCGAGGT
TCCTTATTTC CTGTTTTAGT AGCGATCGCT CTCCAAGCAA TTTTTTATCC CCAGACTGTT
TTACTATCAC TAAGCATCCT AACTATTAGA CTATTCTCCT ATCAGCAAAA GCGGTTAAAA
TTCACCTCAA TTAAACTAAA TTATTTACTT TGGTTAGGAG CAATAATAAC TGCTGCCATA
ATACTTTTAC CATACAAATT AACTGCTACT GAATTCGGAC CAATTATTAC TACTACCGCA
GCCAAAATAC AACCTATTTT CAATTATGCT GATAGTAAAT ATGGCAGAGC ATTCTTTTTT
CATCATAACC CCTTAGTTTT TTGGCTCACA GGACCAAGGA GTGGTATCTT ATTTGTAGGA
TTATTCTCAC CACTGGCTAT AGCTTCATTA CTATTACCTT TTTTACTTAA AAAAGAAAAA
TTTCCTCTAG GAAAACAAGT TTCGGAAAAA GTAGGAATAT TAGTTCAAAT TTTCATAGCA
TCAGTAGGAT TATTTTTTCT AGCTCATATC TTTTTATTTC AGCTTCATTT TCCCAATAGA
TATATTTACC ATAGTATACG AGTGACGATG GCAATAACTG CTGGTATTGC CTTAATAATC
TGGTTAGATA GTTATTTAAA GGCAACTATT TATCAGATAA AAAATAGTTT TACTTGTCTA
CAGGGAATAT ACCTCGGATG TACAACTTTG TTATTAGTAT TATTCAGTAT TATTCCTTTT
TCCACAGACT TAACAATAGA TAATCAATCC TACATTAAAG GCAAAGAAAA AGAATTATAT
GAATACTTAT TAGTACAACC TAAAGATACA TTAATTGCTT CCATATCTAA GGAATCAAAT
AATATTCCTA CCTTTGCTCA ACGTTCGACC TTAGTAGCCC AAGAATATAG CTTACCCTAT
CACGTAGGAT ATTATAATCA ATTTAGTCAA AGAGCCATTG ATTTAATTCA GGCTCAATAT
ACTCCTAACC CAGAACAAGT TAATAATTTT ATTCAAAATT ATGGGGTTGA TTTTTGGTTG
CTCGATCTAA CTGCTTATAA TCCTAGATAT GTAGCTGATA AACAGTTAAT TCGTCAGTAT
AATTTAGCAG ATTCAATTAT TTATCAACTG GAGCAAAATA TGATCCCGGC ATTATCAACA
ACCATGGAAA TTTGTAGTGT ATTAAGCAGT AAGCGAATAA CATTATTATC AACATCATGT
ATTACAAATG AGTTGATAAA ATTGAGTATT AATAATCCTA AAAATTATCA CAAACATAAG
ACTTGA
 
Protein sequence
MSNTLISKSK ITRFLLFLHK NLINRSRIFW LSLTIAVVLI YGIEYLKAAF ETKYIIQDDA 
RQHIFWMRRF FDTELFPEDL IANYFQSVAP WGYQTFYWLI TSLGIDPIFF GKLLPIFLGL
ISTIYCFGIS LQILPIPAVG FLSSFILNQS LWMEDDLVSA TPRAFFYPLF LAFLYYLLRG
SLFPVLVAIA LQAIFYPQTV LLSLSILTIR LFSYQQKRLK FTSIKLNYLL WLGAIITAAI
ILLPYKLTAT EFGPIITTTA AKIQPIFNYA DSKYGRAFFF HHNPLVFWLT GPRSGILFVG
LFSPLAIASL LLPFLLKKEK FPLGKQVSEK VGILVQIFIA SVGLFFLAHI FLFQLHFPNR
YIYHSIRVTM AITAGIALII WLDSYLKATI YQIKNSFTCL QGIYLGCTTL LLVLFSIIPF
STDLTIDNQS YIKGKEKELY EYLLVQPKDT LIASISKESN NIPTFAQRST LVAQEYSLPY
HVGYYNQFSQ RAIDLIQAQY TPNPEQVNNF IQNYGVDFWL LDLTAYNPRY VADKQLIRQY
NLADSIIYQL EQNMIPALST TMEICSVLSS KRITLLSTSC ITNELIKLSI NNPKNYHKHK
T