Gene Tmel_0403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmel_0403 
Symbol 
ID5297660 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermosipho melanesiensis BI429 
KingdomBacteria 
Replicon accessionNC_009616 
Strand
Start bp422659 
End bp423867 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content36% 
IMG OID640768666 
Productrestriction modification system DNA specificity subunit 
Protein accessionYP_001305657 
Protein GI150020303 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGATC GCATCCCGCC CGGCTACAAG AAAACGGAAA TTGGGATAAT TCCGGAAGAT 
TGGAAAATTG GTGAACTTGA AGAAATTGCT GAGGTAATAG ATCCTCATCC CAGTCATCGT
GCTCCTCCTG AAGTTTCAAG AGGGATTCCT TTTGTAGGAA TTGGTGATTT AGATGAAAAT
GGAAATATAA TAAATGATAA TGTTAGAATT GTCCATCCAA AAATATTGGA AGAGCACAAG
AAAAGATATA ATTTGTATGA TAATTTGATA GGTCTTGGAC GTGTTGCTTC AATAGGGAAA
GTTGTTAAAT TAAAGGAAGG CAAATATGCC GTTTCCCCGA CGATGGGGAT AATTAAAAGT
AATTACATAG AATGGAGATA TCTTTATTAT ATATTACAAT CTAAATACGT AATTGAGCAA
TTCAATAAAA TTATGACTGG CTCAACTAGG TCATCTGTAG GAATGATTGT ACTAAGAAAA
TCGAAAATAC CTTACCCTCC AACCATTGAA GAACAACGCG CCATTGCCCG TGTTCTCTCT
GATGTGGACA AGCTAATAGA AAGCCTTGAC AAGCTAATAG AGAAGAAAAA ACTCATCAAA
AAGGGCGCGA TGCAGGAGCT TCTAACAGGC AAAAAACGCC TGCCTGGATT TAAAGGCGAG
TGGGTGAGGA AGAAGTTGGG GGAGGTTGCG GAAATCTACC AGCCTGAAAC TATTTCACAA
AGTCAACTGT CTAATGTTGG TTACAATGTT TATGGTGCTA ATGGAATAAT TGGGAAATAT
CATAAATATA ATCACGAATT TTGGCAAAAT ATAATAACTT GTAGAGGTTC TACTTGTGGA
ATGGTCAATA GAACTACTGA TAAATGTTGG ATAACAGGAA ATGCAATGGT TATAAATGTT
GATAAAAATA AATCTATAGA CAAGTTATTT ATGTTTTACT TATTAAAATT TCAGGATTTT
ACTAAATTAA TTACTGGTTC AGGGCAGCCT CAAATTATTA GAAAACCACT AGTTGAATTT
ATAATTCATT ATCCTTCTGA CATTGAAGAA CAACGCGCCA TCGCCCAAAT CCTCAGTGAT
ATGGATGCAG AGATTGAGGC GCTGGAGAAG AAAAAGGCAA AGTATGAAAT GATAAAAAAG
GGAATGATGC AATTGCTTTT GACAGGAAAA GTTAGACTTA AAGATAGAAT AAAAGAGGTG
TTAAAATGA
 
Protein sequence
MADRIPPGYK KTEIGIIPED WKIGELEEIA EVIDPHPSHR APPEVSRGIP FVGIGDLDEN 
GNIINDNVRI VHPKILEEHK KRYNLYDNLI GLGRVASIGK VVKLKEGKYA VSPTMGIIKS
NYIEWRYLYY ILQSKYVIEQ FNKIMTGSTR SSVGMIVLRK SKIPYPPTIE EQRAIARVLS
DVDKLIESLD KLIEKKKLIK KGAMQELLTG KKRLPGFKGE WVRKKLGEVA EIYQPETISQ
SQLSNVGYNV YGANGIIGKY HKYNHEFWQN IITCRGSTCG MVNRTTDKCW ITGNAMVINV
DKNKSIDKLF MFYLLKFQDF TKLITGSGQP QIIRKPLVEF IIHYPSDIEE QRAIAQILSD
MDAEIEALEK KKAKYEMIKK GMMQLLLTGK VRLKDRIKEV LK