Gene Tcur_4004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTcur_4004 
Symbol 
ID8605360 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermomonospora curvata DSM 43183 
KingdomBacteria 
Replicon accessionNC_013510 
Strand
Start bp4570538 
End bp4571737 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content75% 
IMG OID 
Productcarboxyl-terminal protease 
Protein accessionYP_003301571 
Protein GI269128201 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCCGGA TCACCGGTCG TGTCCCCCGA GGCTTTCGCG CCGGTGGATT CACCCGTGGT 
GCGGCATTGA TCACGGTGGT GCTGTGCGTT TACGGCGCCG GCGTGGTGAC CGGCGCCGAG
GCGCCCTCCT CCTCCACCCG GCCCCAGCGG GGGCCGCTGG ATGAGGCCGC CGAACGGATC
GCCGAGGAGT CGGCCCTGCC GGTCGACCGC GCCGAGCTGC AGCGCGCCGC GGTGAACGGG
ATGCTGCAGC GGCTCGGCGA CCGCTGGGCC CGCTACTACA CCGCGACGGA GTACGACGAC
ACCCGTGGCC GGCTGAACGG GCGCTATAGC GGCGTCGGGC TGTGGCTCGG CGTCGAGGAA
GGGTCCGGAC GGGTCCTGGT GGCCAGCGTT CAGCCGGAGT CGGCCGCCGA ACGCGCCGGA
GTCCGGGTCG GCGACGCCAT CACCGGGATC GGCGACCGCA AGGTCGGCGG ATGGACCGTG
AGCAAGGTCG CCGCCGCGCT GCGCGGCGCC CCCGGCACCT CGGTGACGCT CACCGTGCTG
CGGAAGGGCG CCGAGCGCCA CTTCACGCTG GTGCGCTCGG CCGTGCAGAC CGGCGATGTG
ACCGTGGAGC AGCGTTCCGG CAGCATTCGG GTGATCCGGG TGGCGGCGTT CACCCGCGGG
GTGGGCCGGC AGGTGCGCGA GGCCGTCGAG CGGCCGGCCG GCGGCGCGGA GTCCGGGCTG
ATCCTGGATC TGCGCGGCAA CCCGGGCGGG CTGCTGGAGG AGGCGGTGGA GACCTCCTCG
GCGCTGCTGA GCGACGGGGT GGTCGCCGTC TATGAGCGGC GCGGCGAGCG GCCCCGGGAG
CTGCGCGTCA CCGAGCCGGG GGACGGCCGC ACCCCGCTGG TGGTGCTGGT GGACGCCGGA
ACCGCCAGCG CCGCCGAGGT GGTCGCCGGT TCCCTGCGCG ATCGCGACCG CGCCGTCCTC
GTAGGATCCC GTACCTATGG GAAGGGGTCG GTGCAGGAGC CGGTCCGGCT GCAGGACGGC
TCGGTGATCG AACTGACCGT GGGGCGCTAC CGCACCCCCG GTGGCCGTGA CCTGGACGGG
ACCGGGATCG AGCCCGATGT GGCCGTCTCG GCCGACCGCC CCCCCGAGGA GGCCCTGGAA
CGCGCGGGCG CGGTGCTGCG CGGGCTGATG GCCTCCGCGT CCACCAAGGA TCGACGCTAG
 
Protein sequence
MLRITGRVPR GFRAGGFTRG AALITVVLCV YGAGVVTGAE APSSSTRPQR GPLDEAAERI 
AEESALPVDR AELQRAAVNG MLQRLGDRWA RYYTATEYDD TRGRLNGRYS GVGLWLGVEE
GSGRVLVASV QPESAAERAG VRVGDAITGI GDRKVGGWTV SKVAAALRGA PGTSVTLTVL
RKGAERHFTL VRSAVQTGDV TVEQRSGSIR VIRVAAFTRG VGRQVREAVE RPAGGAESGL
ILDLRGNPGG LLEEAVETSS ALLSDGVVAV YERRGERPRE LRVTEPGDGR TPLVVLVDAG
TASAAEVVAG SLRDRDRAVL VGSRTYGKGS VQEPVRLQDG SVIELTVGRY RTPGGRDLDG
TGIEPDVAVS ADRPPEEALE RAGAVLRGLM ASASTKDRR