Gene Tery_3582 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3582 
Symbol 
ID4244215 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp5509650 
End bp5510867 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content31% 
IMG OID638108547 
ProductMcrBC 5-methylcytosine restriction system component-like 
Protein accessionYP_723136 
Protein GI113477075 
COG category[V] Defense mechanisms 
COG ID[COG4268] McrBC 5-methylcytosine restriction system component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCTT CCCAAATCAT CGAACTCACT GAATATCAAC CTTACCAGTT TTCACCAGAT 
AAAATTGATT ATAGTATTCC TACTAAATTA TGGCAAGAAT ACGATCAAAA AGGTCAAAAA
ATTAAAGTTG AATTTCCTAC TCCCAAAACC AACAACAAAT GGCAATTTAC TTCTCAAGGG
TGGGTGGGTT ATATTCCCAT AACTCTGGAT TTTCACATCA TTCTTAAACC TAAGGTACCA
CTTCATAACC TATTTGGAAT GTTGGAATAT GCCTACAACC TCAGAAGTTT TTGTTTTTTA
GATGGTTTAG TTAACTGTAA TTCTCTACAG GAATTTTACA ACTGTTTAGT TAATATTCTG
GCTCAAAAAA TATTAGAGCG AGGTCGAAAA GGTTTTCATC GTGCTTATCT GCCAAAAACA
GAAAATTTAA CTTATATTCG GGGACGATTA AATATGCGGC AAGTTATGCA CAAACCCTGG
GGTGTTAGTT TAAAATGTGA TTATCAAGAA CATACTGCTA ATATTCCTGA TAATCAAATT
TTGGCTTGGA CTTTGTTTAT CATTAGCCGT AGTAGTTTTT GTTCTGAAAA AGTCGCTGTA
ACTGTAACAA GAGCTTTTCA TATTTTGCAA GGTTTGGTAA CTTTACAACC TTTTAAATCT
AGTGATTGTC TGAATATAAA ATATCATCGT TTGAATGAAG ATTATCAGGT TTTACACGGT
TTATGTCGAT TTTTTTTGGA TAATATTGGA GCTAGTCATC AACAGGGTAA TTACTCAATG
TTACCTTTTT TAATAGATAT GGCTAAACTC TATGAAAAAT TTGTAGCTAA ATGGTTAAAA
TTGCATCTAT CCTCAAATTT AAGAGTTAAA GAACAAGAAA AAGTAGAAAT TGTTGATGAT
AAAATTTATT GTAAAATTGA TTTAGTTATT TATGAAATAA AAACTTGCAA GGTTGTTTAT
ATTCTTGATA CTAAATATAA GTTGGATTGC AGACCATCGA CAGATGATAT TAACCAAGTG
GTAGCTTATG CAACTTATAA AAAATGTCAC GAAGCTATTT TGATTTATCC TCAAAGACTA
ACTAATTATA TTAATCAATT AGTTGGTGAA AGTCAAGTAA GATTGCGTAC TTTGACATTT
GCTATTGACT CTGATTTGGA AAAAGCTGGT CAATCTTTTT TAGAAGAATT AATATCAAAT
CCGGTAGTAT CGTTGTAA
 
Protein sequence
MKSSQIIELT EYQPYQFSPD KIDYSIPTKL WQEYDQKGQK IKVEFPTPKT NNKWQFTSQG 
WVGYIPITLD FHIILKPKVP LHNLFGMLEY AYNLRSFCFL DGLVNCNSLQ EFYNCLVNIL
AQKILERGRK GFHRAYLPKT ENLTYIRGRL NMRQVMHKPW GVSLKCDYQE HTANIPDNQI
LAWTLFIISR SSFCSEKVAV TVTRAFHILQ GLVTLQPFKS SDCLNIKYHR LNEDYQVLHG
LCRFFLDNIG ASHQQGNYSM LPFLIDMAKL YEKFVAKWLK LHLSSNLRVK EQEKVEIVDD
KIYCKIDLVI YEIKTCKVVY ILDTKYKLDC RPSTDDINQV VAYATYKKCH EAILIYPQRL
TNYINQLVGE SQVRLRTLTF AIDSDLEKAG QSFLEELISN PVVSL