Gene Haur_3645 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3645 
Symbol 
ID5735506 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4583875 
End bp4585473 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content53% 
IMG OID641280794 
Productthermolysin 
Protein accessionYP_001546409 
Protein GI159900162 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3227] Zinc metalloprotease (elastase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0278352 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTCGCA AACGTTTAAT CACTGTGGCT GCATTGCTCG GTTTAGTTGG GGCTGTCTTT 
GGTTCTGCTG TTAGCACTGG TGCTCAAGAT TACGACAAAG TTCGCGGGCA GTTGATCCGC
ACTTATCGCG AAGCCGAAGC CAAAGGCGTG CGCCCAGATT GGGTCACCGC AGCGGTTGAT
ACCAGCTTGA ATCACTTCGC TGGCAAGGGT GCCAATAGCG CCAATTTGCA AGTGCGTGGC
GTTGATCAAG ATGATCTGGG CATCAACGTG CGCCTTGACC AAACCTATGC TGGCTTGCCA
GTCTTTGGCG GCCAAGTCAT TGCGCACCTC GACAATAAAG GCAATGTGAC CCAAGAAAGT
GGTGAACTCT TCGCGGTTGA TGGGATTGAT ACCAGCGCTA GCTTAAGCTC GGCCGAGGCG
ATCAAAATTG CTCAAAGCCA AGTCAAGTAT GATTTCAACG CCAAAAACGC GAGCGGTACA
GAAGTCAGTA GCGAACTCAA AATCTTGCCA CGCGAAGGCA AAGATTCGGT GATCGTCTTC
CAAGTAAGCT TGCACATTGA AGATGGCAGC GAAGCAACCG CTCACCACGA GTTCTTCATC
AACGCCAAAA CTGGCGAAAC TGAGTTGTAC TACAACGACA TGGATGGAGT CAACGCGACT
GGGACTGGCA AGAGCTTATA CAGCGGCAAT GTCTCAATTA CCACCGACTT GGTGAGTGGG
GTTTACTATC TGCGCGATAA CTCACGCGGC GGGATGTACA CCACCAACAT GAACAACCGC
ACCACTGGCG GTAGCACCTT CACCGATGCT GATAATGTTT GGGGAACCAA CACGACTGCC
AACGTTCAAA GCGCTGGCGT TGATGCCCAC TATGGCGCTC AATTGACCTG GGACTACTAC
TTGAGCAGCT TTGGCCGCCG TGGCATCGAT GGCAATGGCT TCCGCGTATT GAGCCGCGTG
CACTATGGCA ATCGCTATAA CAACGCCTTC TGGAACGGCT CAAGCATGAC CTATGGTGAT
GGCGATGGCA CAACCTTCCG TCCATTGGTT TCGCTCGATG TTGCAGGCCA CGAAATTACC
CACGGTCTGA CCGAAAAAAC CGCTGGCTTG ATCTATAGCA ACGAATCAGG TGCTGCTAAT
GAATCATTCT CGGATATTTT CGGCACAATG GTCGAATATA GCAGCGGCAC AGGCGATTAT
CTGATTGGCG AAGACATCTA CACCCCAGCC ACCGCTGGCG ATGCGTTGCG CAATATGTCG
AACCCAGCCG CCGAAGGCGA CCCCGACCAC TACAGCAAGC GCTACACTGG CACTGGCGAT
AATGGCGGCG TGCACATCAA CAGTGGGATT CAAAACCAAG TCTTCTATCT GTTGGCTCAA
GGTGGAACCA ACCGCACCTC TGGCTTAGCA GTAACTGGCA TTGGCCGACC AAAAGCCGCT
GCGATCTTCT ATCGTGCCTT GACGGTCTAC TTGACCCCAA GCTCAAACTT CAAGGCCGTG
CGCACCGCAA CCCTCAACGC CGCCCGCGAC CTCTATGGCG CAAGCAGCGC TGAATACAAC
GCAACTGCTC AAGCCTGGAC GGCTTGTGGC GTACAATAA
 
Protein sequence
MVRKRLITVA ALLGLVGAVF GSAVSTGAQD YDKVRGQLIR TYREAEAKGV RPDWVTAAVD 
TSLNHFAGKG ANSANLQVRG VDQDDLGINV RLDQTYAGLP VFGGQVIAHL DNKGNVTQES
GELFAVDGID TSASLSSAEA IKIAQSQVKY DFNAKNASGT EVSSELKILP REGKDSVIVF
QVSLHIEDGS EATAHHEFFI NAKTGETELY YNDMDGVNAT GTGKSLYSGN VSITTDLVSG
VYYLRDNSRG GMYTTNMNNR TTGGSTFTDA DNVWGTNTTA NVQSAGVDAH YGAQLTWDYY
LSSFGRRGID GNGFRVLSRV HYGNRYNNAF WNGSSMTYGD GDGTTFRPLV SLDVAGHEIT
HGLTEKTAGL IYSNESGAAN ESFSDIFGTM VEYSSGTGDY LIGEDIYTPA TAGDALRNMS
NPAAEGDPDH YSKRYTGTGD NGGVHINSGI QNQVFYLLAQ GGTNRTSGLA VTGIGRPKAA
AIFYRALTVY LTPSSNFKAV RTATLNAARD LYGASSAEYN ATAQAWTACG VQ