Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1195 |
Symbol | |
ID | 5733088 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1377217 |
End bp | 1378281 |
Gene Length | 1065 bp |
Protein Length | 354 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641278335 |
Product | metalloendopeptidase glycoprotease family |
Protein accession | YP_001543971 |
Protein GI | 159897724 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCACAGC AAATGCCCGC AGGTTTTACG ATTTTAGCAA TTGAAACATC GTGCGATGAA ACCGCCGCCG CCGTTATTCG CGATGGCCGC GAGATTGTGG CCAATAGCGT CGCTTCACAA ATCGATATTC ATCAACGCTA TGGTGGCGTT GTGCCCGAAG TGGCTTCGCG CCAACATATT CTGACGATTA CACCCGTGAT TAATACGGTG CTGGCCCAAG TGCCCGGCGG TTGGAGCGCG ATTAATGCGG TTGCCACAAC TTATGGACCA GGCTTAGCTG GCGCTTTATT AACTGGAATT AACACCGCCA AGGCGATTGC GTGGTCGCGC AATTTGCCGT TTATCGGGGT TAATCACCTC GAAGGCCATA TTTATGCTTC GTGGCTGCAT ACTGCCAAAG ATCTAGCCTA CCAAGCCCCC GAATTTCCGT GTGTCGCCTT GATTGTTTCT GGTGGCCACA CGGCCTTGGT GTTGCTCAAC GATCATGGTG ATTATCGTTT GCTTGGGCAA ACCCGCGATG ATGCAGTGGG CGAGGCTTTC GATAAGGTAG CGCGAATTAT GGGCTTGGGC TATCCAGGCG GCCCGCAAAT GGAAAAAGTC GCGCGTGGGG TTAATCCAGG TGCACTCAAA TTGCCCCGAG CATGGTTGCG CGGAACCTAC GATTGGAGCT TTAGCGGCCT CAAAACCGCT GTGCTGAACG TGGTCAATGA TCGTTTGGGT GAGCGCATGG AGAAAGTTAA ACTTGCTGAA GTTGACCCAG CCTTCACCGC CCAGCTTGCC GCTGCCTTCC AAGATTCGGC AGTCGATGTG TTGGTGCAAA AAACCGTGGC TGCTGCTCGC GAATACAAAG CCAAAACGAT TATTTTAGCT GGTGGCGTAG CCGCAAATAC TGCCTTACGC GAGCGCTTGA GCGCAACCGC CAAGCCTATT CCGGTAGCCT ACCCGCCAAT TTGGCTGTGT ACCGATAATG CCGCCATGAT CGGCGCTGCC GCCTATTATC GTTATCAAGC AGGCGTGCAG CAGGATTGGT CGCTCGATGC CACTCCCAGT CTCAAGCTGA TTTAA
|
Protein sequence | MSQQMPAGFT ILAIETSCDE TAAAVIRDGR EIVANSVASQ IDIHQRYGGV VPEVASRQHI LTITPVINTV LAQVPGGWSA INAVATTYGP GLAGALLTGI NTAKAIAWSR NLPFIGVNHL EGHIYASWLH TAKDLAYQAP EFPCVALIVS GGHTALVLLN DHGDYRLLGQ TRDDAVGEAF DKVARIMGLG YPGGPQMEKV ARGVNPGALK LPRAWLRGTY DWSFSGLKTA VLNVVNDRLG ERMEKVKLAE VDPAFTAQLA AAFQDSAVDV LVQKTVAAAR EYKAKTIILA GGVAANTALR ERLSATAKPI PVAYPPIWLC TDNAAMIGAA AYYRYQAGVQ QDWSLDATPS LKLI
|
| |