Gene Adeh_1844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAdeh_1844 
Symbol 
ID3887408 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter dehalogenans 2CP-C 
KingdomBacteria 
Replicon accessionNC_007760 
Strand
Start bp2104545 
End bp2106383 
Gene Length1839 bp 
Protein Length612 aa 
Translation table11 
GC content70% 
IMG OID637863382 
Productpeptidase U35, phage prohead HK97 
Protein accessionYP_465053 
Protein GI86158268 
COG category 
COG ID 
TIGRFAM ID[TIGR01543] phage prohead protease, HK97 family 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGACCC GCACCATGTC CGTCGACCTG GGCGAACGCG AGCGCAAGGA AGCCGCCTCG 
TCGCGCCGGT TCCCCGTATC CGTCTCCAGC GAGACCCCGG TCCAACGCGT GGACTGGCAG
AGCGGCGGTC AGCTGTTCGA CGAGGTCCTC AGCCACGCGC GCGGCGCGGT CGACCTCTCG
CGCGCCCCGC TGCCCGTTCT CGAGTCGCAC GACCGCTCCA AGGTGAACGT CGGTGTCGTT
CGCGACCTCA AGCTCGACGG GAAGCGCCTT CGCGGCGAGC TGGTCCTCGG GCAGAGCGAG
CGCGCCAAGG AGCTGGCCGC GGACATCGCC GACGGAATCG TGACCGGCAT CAGCGTCGGA
TACACGATCA GTCAAGAGAC GCGCGACGAG AAGGCGAAGC GGATCACCGC GACGCGCTGG
TGTCCCTACG AAGTCTCGAT CGTTTCCGTT CCCGCCGATC CGAGCGTCGG CATCAACAGG
AGTGCAAGCA TGGACGAGCA GAGCGCCACC ACCACCCCGC CCACCGACAA CACCACGCCT
CCCGAGGTCC TCGCGGAGCG CGAGCGCGCG AGCGAGATCC ATGCGCTCGT GACCCGCGCG
CGGCTCGGCG CCGACTTCGG CGCACAGCTC GTGCGCGACG GCGTCCCGCT GGAGCAGGCC
CGCGCCCGCA TCCTGGATGC GCTCGTCGTT CGGGACCAGC AGGCGCCGAC CAACCAGCAC
ACGCGCATGG ACTACGGCCG TTTCGACGCC ATCAGCCCTG GAACCGACTA CGGCGAGGAC
TTCCGCCGTG CGGCCGTCGA TTCGCTGCTC ATCCGGTCCG GCATCCCGGT GGCGAGGCCG
CACGCAGGCG CGCGCGACCT GTCGGGCAGC GTCTACGACC TCGCTCGGCT CTCGCTCTCT
CGACAGGGGC GCACCGGGTC TCGCTTCGGC GAGAGCCGAG GCCCCGAGCT GATCAAGCGA
GCGATGGTCA CCGCAGACTT CCCCGCGATC CTTGCGGGTG CGCTCCACGC GTCCGTGCGG
AGCGGCTACG AGAGCGAGCC CGCGTCGCAC CGGGCCTGGG TTCGCGCCGT GCCCGTCCCC
GACTTCCGCA CGCAGCAGCG GCCGATCCTG GGGTCGGCGC CCTCGCTCGC CCAGGTGGGC
GAGCACGGTG AGTACACCGA CGGCTACTTC ACCGACGAGA TGGCGTCGTA CTCGGTGATG
AAGTACGGCC GCATGGTCGC GATCTCCTGG GAAGCCCTGG TCAACGACAA CCTGGGCGCC
TTCCTGCGCG TGCAGCCGGC GCTCGGGCAG GCGGCGCGAC GCGCCGAGGC CGACACGGTG
TACGCGCTGT TCGCGCTCAA CAGCGCCGCT GGCCCGACGA TGCAGGACGG CACGGCCCTG
TTCCACGCGA ACCACGCGAA CCTCGCCACC TCTGCTGCGT TCGACTCCGC GCAGCTCGGG
GCCGGCCGCG CGCTGCTGCG CAAGCAGCAG GCTCTCGGGG GCGGGTACCT CTCGCTCGTG
CCCCGCTACC TCGTCGTCCC CAGCGAGCGC GAGACGGCCG CCGAGGTCCT GCTGGCCAAC
GCCACCCGGC GCGTGAACAC GGAGAAGACC ACGCCCGAGT GGATCGCGTC GCTCGAACTC
GTGGTCGAGC CGCGCCTCGC CAACACGGCC GTCTACCTCG CCGCCGAGTA CAACCAGATC
GACACCGCCG AACTCGGGCT GCTCGAGGAG AACATGAACG GGCCGGTGGT CGAGGAGGAG
GCGGAGTTCC GCAAGGACGT GAAGCAGTGG AAGGTGCGCC ACGTCTTCGG GGCCAAGTTC
CTCGACTGGC GCGGCGTCGT GAAGATGCCG GTGACCTGA
 
Protein sequence
MQTRTMSVDL GERERKEAAS SRRFPVSVSS ETPVQRVDWQ SGGQLFDEVL SHARGAVDLS 
RAPLPVLESH DRSKVNVGVV RDLKLDGKRL RGELVLGQSE RAKELAADIA DGIVTGISVG
YTISQETRDE KAKRITATRW CPYEVSIVSV PADPSVGINR SASMDEQSAT TTPPTDNTTP
PEVLAERERA SEIHALVTRA RLGADFGAQL VRDGVPLEQA RARILDALVV RDQQAPTNQH
TRMDYGRFDA ISPGTDYGED FRRAAVDSLL IRSGIPVARP HAGARDLSGS VYDLARLSLS
RQGRTGSRFG ESRGPELIKR AMVTADFPAI LAGALHASVR SGYESEPASH RAWVRAVPVP
DFRTQQRPIL GSAPSLAQVG EHGEYTDGYF TDEMASYSVM KYGRMVAISW EALVNDNLGA
FLRVQPALGQ AARRAEADTV YALFALNSAA GPTMQDGTAL FHANHANLAT SAAFDSAQLG
AGRALLRKQQ ALGGGYLSLV PRYLVVPSER ETAAEVLLAN ATRRVNTEKT TPEWIASLEL
VVEPRLANTA VYLAAEYNQI DTAELGLLEE NMNGPVVEEE AEFRKDVKQW KVRHVFGAKF
LDWRGVVKMP VT