Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4473 |
Symbol | |
ID | 5736324 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5718184 |
End bp | 5720076 |
Gene Length | 1893 bp |
Protein Length | 630 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641281636 |
Product | peptidase M23B |
Protein accession | YP_001547233 |
Protein GI | 159900986 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0739] Membrane proteins related to metalloendopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACGTT TTAACCCCAA ACGCTATTTC GATTTACGCA ATCGGCTGCA AAGCGATCAA CAGCGCCTCT GGGCTTTGTT CAGCACTTTA ATCATCAGTT TGTTGTTAAC GACGCTGGTG CTACGGATTG CCCCACCTGC CACTGAAACT GGGCGCTTGA TTTTTCGGCC TGTTGATTCG GTGACGGAGG ACGACCGCAC CCAAGCCGAA CAACCACCAC CAACACTGCC CCCAGCGATC GAACCGCAAA ATCTGGTTGA TGCAAGCGCG ATCTTGCCCC GTTCTGCCGA AGCGCCAACA ATCATCGATG ATATTCGCTT TACATCCGAG CAAAATCTGA CTGTCGCCAA TATTCAAACG TTGTTAGATG CACAGCCTGG CACGCTCAAA GCCGCCTTGG TAACAGTTGG CGATCGCAAT TTATCGCTGG CCGAGGTGGT GGTTGGCCAA GCCTATTTGT ATAGTCTTAA TCCCAAATTG CTGCTGGCCT TGCTCGAATT TCAACAAGGC CTGCTGACCA ACCCAACCCC CAACCCTGAT CAACTCGATT GGGCCATGAA ATATCAGGGC GAGGATGAAA AATGGCGCGG CTTGCATGGC CAAATTCGTT GGGCCGCCCG TGAATTGCGG CGGGGTGTGC GTGATTTCGC CTATGTTACC GAGTTGCAAT ATCGCGATAA AGATGTCAAA GGCCCAATTC CAGCTGGTTT AAACCCAAGC TCGTATGCAG TGATACGGGT GCTGGCTCAA ACCATGACTC CCGAAGAATT GGCGAAAGTG CTCAGCGATG GCAGTTTTGT CGCAACCTAC AGCAAGTTTT TCGAAGATCC CCGTCAAACG TTGGGCCAAG TACCAGCGCC AGCAACGCCA TTTTTACGTT GGCCGCTACG CAATGTCACC TATATCACTT CGTTTTTTGA TCACGAATAT CCCTTTCTAA CGCCCAATCA ATCCTTGGTG AGTTGGTGGG GGCGACGTGA AACCGAGCTT TCCTATGATG GTCACGATGG CTGGGATTAT GGCGCACGAC CGCCCGAAGC AGTGGTTGCC GCCGCTGATG GCACGGTGGT TTGGGCCAGC AATTCTGATG ATGGTTGTGG TGTGCCAGCC AAAGGCGTGG TGCTTGATCA TGGCAATGGC TATCGCACGC TCTATTGGCA TCTGAGCGAA ATTTCGGTCG AGCTTGGCCA ATCGATCAAA GGCGGCGAAC AATTGGGCAT CGTTGGCTCA ACTGGCTGTG CGATCGGCCC ACACTTGCAC TTTCAAACCC AATACCTTGG CCGCAACACC GATCCGTATG GTTGGTGCTC AAGCGAACCC GACCCATGGA GCAGCTATCC AGTTGGCACA GCTTCGCGCT GGCTTTGGGC CGATCGCCCG AATCCTTGCG ATCTTGGGCA AACCATCGCA GTGCGGCCAA GCGATCAAGG ATTTAGTCGC AGCGAAGGCA ATTGGCAAAA TGCCCCAATC GGTGCTGGTG GCGAAACCCT TTGGATTACC TCGCAAATTC CCATAACAAC CACTGAAACC CTAACCGACA CAATGTCGGA TTTGGCAGGC GTTGCCACGC CTCAACCAAC GCCAAGCCAA CCACCAAGCA CCGCTACCTG GCAAACCAGC ATTCCCAGCG CTGGGCGTTA TCGTGTGCTA ACGTATATTC CCTACTACTA CAACGGCCAC GATGATGCTG TTGCCGCCCA TTATGTGATT GAACACGCCG AAGGTCGCAG CGATGTGGTA GTCAATCAGT TTGTGTATGC CAACGAATGG GCTGATCTTG GCACCTACAC CTTCGACCCT AGCAAACCGG CCAAGGTCGA GCTAAGCAAC GAAACCAGCA TGGCCGACCA AGGGATCTGG GTTGGCACAA CCGTTTGGCT GCCTGCCGAT TGA
|
Protein sequence | MQRFNPKRYF DLRNRLQSDQ QRLWALFSTL IISLLLTTLV LRIAPPATET GRLIFRPVDS VTEDDRTQAE QPPPTLPPAI EPQNLVDASA ILPRSAEAPT IIDDIRFTSE QNLTVANIQT LLDAQPGTLK AALVTVGDRN LSLAEVVVGQ AYLYSLNPKL LLALLEFQQG LLTNPTPNPD QLDWAMKYQG EDEKWRGLHG QIRWAARELR RGVRDFAYVT ELQYRDKDVK GPIPAGLNPS SYAVIRVLAQ TMTPEELAKV LSDGSFVATY SKFFEDPRQT LGQVPAPATP FLRWPLRNVT YITSFFDHEY PFLTPNQSLV SWWGRRETEL SYDGHDGWDY GARPPEAVVA AADGTVVWAS NSDDGCGVPA KGVVLDHGNG YRTLYWHLSE ISVELGQSIK GGEQLGIVGS TGCAIGPHLH FQTQYLGRNT DPYGWCSSEP DPWSSYPVGT ASRWLWADRP NPCDLGQTIA VRPSDQGFSR SEGNWQNAPI GAGGETLWIT SQIPITTTET LTDTMSDLAG VATPQPTPSQ PPSTATWQTS IPSAGRYRVL TYIPYYYNGH DDAVAAHYVI EHAEGRSDVV VNQFVYANEW ADLGTYTFDP SKPAKVELSN ETSMADQGIW VGTTVWLPAD
|
| |