Gene Haur_4473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4473 
Symbol 
ID5736324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5718184 
End bp5720076 
Gene Length1893 bp 
Protein Length630 aa 
Translation table11 
GC content53% 
IMG OID641281636 
Productpeptidase M23B 
Protein accessionYP_001547233 
Protein GI159900986 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACGTT TTAACCCCAA ACGCTATTTC GATTTACGCA ATCGGCTGCA AAGCGATCAA 
CAGCGCCTCT GGGCTTTGTT CAGCACTTTA ATCATCAGTT TGTTGTTAAC GACGCTGGTG
CTACGGATTG CCCCACCTGC CACTGAAACT GGGCGCTTGA TTTTTCGGCC TGTTGATTCG
GTGACGGAGG ACGACCGCAC CCAAGCCGAA CAACCACCAC CAACACTGCC CCCAGCGATC
GAACCGCAAA ATCTGGTTGA TGCAAGCGCG ATCTTGCCCC GTTCTGCCGA AGCGCCAACA
ATCATCGATG ATATTCGCTT TACATCCGAG CAAAATCTGA CTGTCGCCAA TATTCAAACG
TTGTTAGATG CACAGCCTGG CACGCTCAAA GCCGCCTTGG TAACAGTTGG CGATCGCAAT
TTATCGCTGG CCGAGGTGGT GGTTGGCCAA GCCTATTTGT ATAGTCTTAA TCCCAAATTG
CTGCTGGCCT TGCTCGAATT TCAACAAGGC CTGCTGACCA ACCCAACCCC CAACCCTGAT
CAACTCGATT GGGCCATGAA ATATCAGGGC GAGGATGAAA AATGGCGCGG CTTGCATGGC
CAAATTCGTT GGGCCGCCCG TGAATTGCGG CGGGGTGTGC GTGATTTCGC CTATGTTACC
GAGTTGCAAT ATCGCGATAA AGATGTCAAA GGCCCAATTC CAGCTGGTTT AAACCCAAGC
TCGTATGCAG TGATACGGGT GCTGGCTCAA ACCATGACTC CCGAAGAATT GGCGAAAGTG
CTCAGCGATG GCAGTTTTGT CGCAACCTAC AGCAAGTTTT TCGAAGATCC CCGTCAAACG
TTGGGCCAAG TACCAGCGCC AGCAACGCCA TTTTTACGTT GGCCGCTACG CAATGTCACC
TATATCACTT CGTTTTTTGA TCACGAATAT CCCTTTCTAA CGCCCAATCA ATCCTTGGTG
AGTTGGTGGG GGCGACGTGA AACCGAGCTT TCCTATGATG GTCACGATGG CTGGGATTAT
GGCGCACGAC CGCCCGAAGC AGTGGTTGCC GCCGCTGATG GCACGGTGGT TTGGGCCAGC
AATTCTGATG ATGGTTGTGG TGTGCCAGCC AAAGGCGTGG TGCTTGATCA TGGCAATGGC
TATCGCACGC TCTATTGGCA TCTGAGCGAA ATTTCGGTCG AGCTTGGCCA ATCGATCAAA
GGCGGCGAAC AATTGGGCAT CGTTGGCTCA ACTGGCTGTG CGATCGGCCC ACACTTGCAC
TTTCAAACCC AATACCTTGG CCGCAACACC GATCCGTATG GTTGGTGCTC AAGCGAACCC
GACCCATGGA GCAGCTATCC AGTTGGCACA GCTTCGCGCT GGCTTTGGGC CGATCGCCCG
AATCCTTGCG ATCTTGGGCA AACCATCGCA GTGCGGCCAA GCGATCAAGG ATTTAGTCGC
AGCGAAGGCA ATTGGCAAAA TGCCCCAATC GGTGCTGGTG GCGAAACCCT TTGGATTACC
TCGCAAATTC CCATAACAAC CACTGAAACC CTAACCGACA CAATGTCGGA TTTGGCAGGC
GTTGCCACGC CTCAACCAAC GCCAAGCCAA CCACCAAGCA CCGCTACCTG GCAAACCAGC
ATTCCCAGCG CTGGGCGTTA TCGTGTGCTA ACGTATATTC CCTACTACTA CAACGGCCAC
GATGATGCTG TTGCCGCCCA TTATGTGATT GAACACGCCG AAGGTCGCAG CGATGTGGTA
GTCAATCAGT TTGTGTATGC CAACGAATGG GCTGATCTTG GCACCTACAC CTTCGACCCT
AGCAAACCGG CCAAGGTCGA GCTAAGCAAC GAAACCAGCA TGGCCGACCA AGGGATCTGG
GTTGGCACAA CCGTTTGGCT GCCTGCCGAT TGA
 
Protein sequence
MQRFNPKRYF DLRNRLQSDQ QRLWALFSTL IISLLLTTLV LRIAPPATET GRLIFRPVDS 
VTEDDRTQAE QPPPTLPPAI EPQNLVDASA ILPRSAEAPT IIDDIRFTSE QNLTVANIQT
LLDAQPGTLK AALVTVGDRN LSLAEVVVGQ AYLYSLNPKL LLALLEFQQG LLTNPTPNPD
QLDWAMKYQG EDEKWRGLHG QIRWAARELR RGVRDFAYVT ELQYRDKDVK GPIPAGLNPS
SYAVIRVLAQ TMTPEELAKV LSDGSFVATY SKFFEDPRQT LGQVPAPATP FLRWPLRNVT
YITSFFDHEY PFLTPNQSLV SWWGRRETEL SYDGHDGWDY GARPPEAVVA AADGTVVWAS
NSDDGCGVPA KGVVLDHGNG YRTLYWHLSE ISVELGQSIK GGEQLGIVGS TGCAIGPHLH
FQTQYLGRNT DPYGWCSSEP DPWSSYPVGT ASRWLWADRP NPCDLGQTIA VRPSDQGFSR
SEGNWQNAPI GAGGETLWIT SQIPITTTET LTDTMSDLAG VATPQPTPSQ PPSTATWQTS
IPSAGRYRVL TYIPYYYNGH DDAVAAHYVI EHAEGRSDVV VNQFVYANEW ADLGTYTFDP
SKPAKVELSN ETSMADQGIW VGTTVWLPAD