Gene DET1037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDET1037 
Symbol 
ID3229670 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDehalococcoides ethenogenes 195 
KingdomBacteria 
Replicon accessionNC_002936 
Strand
Start bp940964 
End bp942085 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content51% 
IMG OID637120601 
Productserine protease 
Protein accessionYP_181753 
Protein GI57234203 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0643787 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGA AACAAAAATT ATTAAGTCTG TTTCTGGGCA TAGTACTTGT TGTTAGTGTT 
CTCAGCGGAG GCTGTGATTA CTTATCCCAG CCTATAAACT CTGACAATAC CGACAGCACT
TCTACCCCTA TTGATGCAGA CTGGACATTC CCCACTCCCC AGCAGAATCT GCCGGAACTG
GCCAACTATG CCATGGTGGT TGCCATGGTA AAACCAGCCG TGGTAGCGGT AGATGTGGAA
TACATAACCC AGGATATATT CGGCCGCCAA ACGGTTGCCG TAGCCTCAGG TTCGGGTTTC
ATAATAGACC CCAGCGGCTA TATTATTACC AACAACCACG TAGTTGAAGG CGGAAGCACT
GTCACCGTCA CCCTTTCAGA CGGCCGTACC TTTACCGCCA GCCAGGTGGT AACAGATTCA
CGCACAGACC TGGCGGTAAT CAAGGTGGAT ACACTGGGTG AAGACCTGCC GTTTGTATAT
ATAGGTGATT CGTCAGCTTT GGAAGTAGGC GAACCGGTGG CGGCTATCGG CAATGCATTG
GGGCTGGGGA TAACCATGAA AGGCGGCTGG ATAAGCCGTC TGGATGCCCA GATAACCGTT
GACCAGAGTG TAACCCTGTA CGGTTTGATA GGTACAGATG TAGCCATAAA CGAAGGCAAT
TCCGGCGGCC CGCTGGTAAA TATGGCCGGT GAGGTTATCG GCATTACCTC TGCCAAAATA
GCGGAAGTGG GGGTGGAAGG GGTAGGCTAC GCTATAAATA TAAACTCCGC CCGCACCTTC
ATTGAAGAGC TGGTCAAAAA AGGCTATATT ACCCGGCCTT TTATGGGAGT GGCCGGCATA
CTGACCGTAG ACAGTTCAAT CCAGTCATAC TTCAGGCTGG GCATAGACAG AGGGGTGCTT
ATCCGGGGCG TGTCTGAAGG CGGACCCGCC GAAAAAGCAG GTCTAATGGC AAATGATGTT
ATTCTGGCCA TAAACGGCCA GCCAGTGCTG ACTGATGAAG AACTGATACT AGCTATCCAC
GGCAAAAAGA TAGGCGATAA AATAGAGGTC AGCTATTTCC GGGACGGAGT AACCGCTACT
GTCACTCTGA CACTGGCAGA GACCCCGCCG CCGGAAAGCT AG
 
Protein sequence
MKKKQKLLSL FLGIVLVVSV LSGGCDYLSQ PINSDNTDST STPIDADWTF PTPQQNLPEL 
ANYAMVVAMV KPAVVAVDVE YITQDIFGRQ TVAVASGSGF IIDPSGYIIT NNHVVEGGST
VTVTLSDGRT FTASQVVTDS RTDLAVIKVD TLGEDLPFVY IGDSSALEVG EPVAAIGNAL
GLGITMKGGW ISRLDAQITV DQSVTLYGLI GTDVAINEGN SGGPLVNMAG EVIGITSAKI
AEVGVEGVGY AININSARTF IEELVKKGYI TRPFMGVAGI LTVDSSIQSY FRLGIDRGVL
IRGVSEGGPA EKAGLMANDV ILAINGQPVL TDEELILAIH GKKIGDKIEV SYFRDGVTAT
VTLTLAETPP PES