Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3645 |
Symbol | |
ID | 5735506 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4583875 |
End bp | 4585473 |
Gene Length | 1599 bp |
Protein Length | 532 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641280794 |
Product | thermolysin |
Protein accession | YP_001546409 |
Protein GI | 159900162 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3227] Zinc metalloprotease (elastase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0278352 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTCGCA AACGTTTAAT CACTGTGGCT GCATTGCTCG GTTTAGTTGG GGCTGTCTTT GGTTCTGCTG TTAGCACTGG TGCTCAAGAT TACGACAAAG TTCGCGGGCA GTTGATCCGC ACTTATCGCG AAGCCGAAGC CAAAGGCGTG CGCCCAGATT GGGTCACCGC AGCGGTTGAT ACCAGCTTGA ATCACTTCGC TGGCAAGGGT GCCAATAGCG CCAATTTGCA AGTGCGTGGC GTTGATCAAG ATGATCTGGG CATCAACGTG CGCCTTGACC AAACCTATGC TGGCTTGCCA GTCTTTGGCG GCCAAGTCAT TGCGCACCTC GACAATAAAG GCAATGTGAC CCAAGAAAGT GGTGAACTCT TCGCGGTTGA TGGGATTGAT ACCAGCGCTA GCTTAAGCTC GGCCGAGGCG ATCAAAATTG CTCAAAGCCA AGTCAAGTAT GATTTCAACG CCAAAAACGC GAGCGGTACA GAAGTCAGTA GCGAACTCAA AATCTTGCCA CGCGAAGGCA AAGATTCGGT GATCGTCTTC CAAGTAAGCT TGCACATTGA AGATGGCAGC GAAGCAACCG CTCACCACGA GTTCTTCATC AACGCCAAAA CTGGCGAAAC TGAGTTGTAC TACAACGACA TGGATGGAGT CAACGCGACT GGGACTGGCA AGAGCTTATA CAGCGGCAAT GTCTCAATTA CCACCGACTT GGTGAGTGGG GTTTACTATC TGCGCGATAA CTCACGCGGC GGGATGTACA CCACCAACAT GAACAACCGC ACCACTGGCG GTAGCACCTT CACCGATGCT GATAATGTTT GGGGAACCAA CACGACTGCC AACGTTCAAA GCGCTGGCGT TGATGCCCAC TATGGCGCTC AATTGACCTG GGACTACTAC TTGAGCAGCT TTGGCCGCCG TGGCATCGAT GGCAATGGCT TCCGCGTATT GAGCCGCGTG CACTATGGCA ATCGCTATAA CAACGCCTTC TGGAACGGCT CAAGCATGAC CTATGGTGAT GGCGATGGCA CAACCTTCCG TCCATTGGTT TCGCTCGATG TTGCAGGCCA CGAAATTACC CACGGTCTGA CCGAAAAAAC CGCTGGCTTG ATCTATAGCA ACGAATCAGG TGCTGCTAAT GAATCATTCT CGGATATTTT CGGCACAATG GTCGAATATA GCAGCGGCAC AGGCGATTAT CTGATTGGCG AAGACATCTA CACCCCAGCC ACCGCTGGCG ATGCGTTGCG CAATATGTCG AACCCAGCCG CCGAAGGCGA CCCCGACCAC TACAGCAAGC GCTACACTGG CACTGGCGAT AATGGCGGCG TGCACATCAA CAGTGGGATT CAAAACCAAG TCTTCTATCT GTTGGCTCAA GGTGGAACCA ACCGCACCTC TGGCTTAGCA GTAACTGGCA TTGGCCGACC AAAAGCCGCT GCGATCTTCT ATCGTGCCTT GACGGTCTAC TTGACCCCAA GCTCAAACTT CAAGGCCGTG CGCACCGCAA CCCTCAACGC CGCCCGCGAC CTCTATGGCG CAAGCAGCGC TGAATACAAC GCAACTGCTC AAGCCTGGAC GGCTTGTGGC GTACAATAA
|
Protein sequence | MVRKRLITVA ALLGLVGAVF GSAVSTGAQD YDKVRGQLIR TYREAEAKGV RPDWVTAAVD TSLNHFAGKG ANSANLQVRG VDQDDLGINV RLDQTYAGLP VFGGQVIAHL DNKGNVTQES GELFAVDGID TSASLSSAEA IKIAQSQVKY DFNAKNASGT EVSSELKILP REGKDSVIVF QVSLHIEDGS EATAHHEFFI NAKTGETELY YNDMDGVNAT GTGKSLYSGN VSITTDLVSG VYYLRDNSRG GMYTTNMNNR TTGGSTFTDA DNVWGTNTTA NVQSAGVDAH YGAQLTWDYY LSSFGRRGID GNGFRVLSRV HYGNRYNNAF WNGSSMTYGD GDGTTFRPLV SLDVAGHEIT HGLTEKTAGL IYSNESGAAN ESFSDIFGTM VEYSSGTGDY LIGEDIYTPA TAGDALRNMS NPAAEGDPDH YSKRYTGTGD NGGVHINSGI QNQVFYLLAQ GGTNRTSGLA VTGIGRPKAA AIFYRALTVY LTPSSNFKAV RTATLNAARD LYGASSAEYN ATAQAWTACG VQ
|
| |