Gene Tfu_0221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTfu_0221 
Symbol 
ID3578934 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermobifida fusca YX 
KingdomBacteria 
Replicon accessionNC_007333 
Strand
Start bp249369 
End bp250619 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content71% 
IMG OID637683905 
Productsubtilisin-like serine protease 
Protein accessionYP_288282 
Protein GI72160625 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGCACGA CACGGGGTTG GACTCCCGCC GCGCTGGCTC TCTGCTCGGC GCTCGCTGTC 
GGCCTGTCCG CCGCACCGGC AGCAGCCGAC GTTTCCGACC TGCGTCTTGA ACAGTGGGGC
CTGGACATGG TCGGTGCGGC CGAGGTGTGG GAGGAAGCCC AAGGCTCCGG GGTGACGGTT
GCCGTGCTCG ACACCGGTGT TGTCACCGAC CACCCCGACT TGAAGGACGT CACCGTCGGC
CCCGACTTCA CCGGTCACGA CCTGTCCTCA GACAGCGACG GCTACGGGAT CCACGGCACG
ATGATGGCCG GAATCGTGGC GGCCAGCGGG CACGGTGTCG AACACACCGG CGGTGTGATG
GGAGTGGCCC CCGAAGCAGA GATCCTGGCG ATCCGCATCA CCGCGGAACC GGACGGCCCT
GACCGGGACA GTGTGGACCC GGGGGCCTTG GCCGCAGGCA TCCGCTACGC CGTGGACGAA
GGCGCCCAGG TGATCTGCCT GCCCTTGGCT GGCGCAGAGT TCTCCGTCCA AGCCAACGAA
GCAGACCAGG AGGCCATCGC CTATGCGGTG AACCACGGGG TGGTAGTCGT CGCCTCCGGC
GGTCCCCTCG GGGAGGCCAG CTACCCGGCC GCCTACCCGG GCGTGCTTGC CGTCGGCTCG
GTCGGCCCCG ACGGGTCCCT CTCCGAGTTC TCCAGCCGGG GCGACCACAT CGCGGTGACC
GCCCCGGGCG AGGAGATCAC CGTGGTGGAC CCGGACGGCG GCTACACCAC AGTCTCCGGA
AGCGACGCGG CCGCGGCGTT CGTCGCGGGC GTGGCCGCGC TGATCCGCGG CGAGTTCCCG
CAGTTGAAAC CGGAGCAGGT GGTGGAGGCG ATCACCTCGG GCGCGCAGGC TGCCGACCCT
GCCGCAGCGG GGCAGCCCGG CTACGGAGCC GGAGTGGTGA ACGCGCCGGA CGCGTTCACC
ACGGCCAAGT CGACCGCGGA CCACGTGCCG CCGTTCGACC CGGAGCTGGC CGAGCAGCTC
GAGGAAGAGC CGCTCATCCC CTACTGGATG CTGTGGACGG GCGGGGCTGT GCTCCTCATC
GTCGCGGCCG TGGTGGCAAT GGTGGTCGCC CACCGGCGCG CCGCCGACCC CTACGGTTTC
GGCAAGCGGA AGCCGGAAGA ACCGGAAGAG CCGGAGCCCG TACCGACCGC GCGGCGCCCG
GTGCGGGGCC GTCGGCGCCG CGGACGCGGA CGGCGCGGTG TCAGTAGGTG A
 
Protein sequence
MCTTRGWTPA ALALCSALAV GLSAAPAAAD VSDLRLEQWG LDMVGAAEVW EEAQGSGVTV 
AVLDTGVVTD HPDLKDVTVG PDFTGHDLSS DSDGYGIHGT MMAGIVAASG HGVEHTGGVM
GVAPEAEILA IRITAEPDGP DRDSVDPGAL AAGIRYAVDE GAQVICLPLA GAEFSVQANE
ADQEAIAYAV NHGVVVVASG GPLGEASYPA AYPGVLAVGS VGPDGSLSEF SSRGDHIAVT
APGEEITVVD PDGGYTTVSG SDAAAAFVAG VAALIRGEFP QLKPEQVVEA ITSGAQAADP
AAAGQPGYGA GVVNAPDAFT TAKSTADHVP PFDPELAEQL EEEPLIPYWM LWTGGAVLLI
VAAVVAMVVA HRRAADPYGF GKRKPEEPEE PEPVPTARRP VRGRRRRGRG RRGVSR