Gene Haur_5290 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5290 
Symbol 
ID5737248 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009974 
Strand
Start bp81342 
End bp83006 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content44% 
IMG OID641282454 
Productpeptidase M23B 
Protein accessionYP_001548045 
Protein GI159901800 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGGTAC GTAGGCGACA GGTTCGACTC TTGGCTGTTG TACTATTCTT TTGTTTAATG 
AGTATTCTTC CATTAAATGC ATCTGCATTT CAAGCAGAGC TTGTTGATGC TATTAAAATG
GAGGCATATT CCACTTTTCA CCTTGAAGAT CAAAAAGCAA TATTAATTGA AAATATTCGC
TTAGATCTCC CTTGGGCTTC TGGAACGATT GTCCTTCTCG ATCCTAATGT GACTGCTACA
ACTCCTCATA TGGGTATCTT TCTTGGAATG TATGATAAGA AAGAGCAACG TTGGACTATT
GCATTCCGCA CAAATCCGCA ATTTCAGACA TGGCTTCCCG ATGTTCCTCT AACGGTTTTT
AACGAAACAG AAAAGGGTTT CTTTTCTAAC CCACAGCGCT TCGTTGATGG ACTTGCGCTG
TTAAGTCTCC CATGGCCTGC GCATGAGAGT CGCTATATGA GCCAAGGTCC TCATAACTAT
AATGGAGGAA ATGTTAATCC TCTATCATCA CTTGATTTTA CTGGCGGGTC TGGGGGTGTT
TATGCAGCCC GCGAAGGGCT AGTTCAAGTT TTATGTGGCG GTAATAAAGT TTTTATTGAT
CATGGCGATG GATGGCAAAC GGGATACTAT CATCTTACAG CACTTGATCC GCGTATTATC
ACTGGAAGTT TTGTCCAGCG CGGACAATAT CTTGGACAAA TGGGTACAGG GGTTGGATGT
GGAGGATCTG CTACAGGCAA CCATGTGCAT TTTACACTCT ATAAACATAG TATTCCTGAA
GATCTTGATG GAAAGGTGAT CGGTGGATGG ATAATTAACG GGAATTGTTT TATTCGTAAC
GCTGAAACAA TATGTAATGA GGGTTGGGTT GTGTCTGATG GATCAATAGG ATCAGGAACA
TCCAGTCAAT TACAAGCTAC CCCTAATCCT ATCCCCGCAT CAACAATATT GGAGAGCAGC
ACAATTAGAT GGAATACGGG CGGAATTTTT GGACAGGTCT ATGTATCGAA GGATGGAGCG
CCAGAAGTTC TGATGACCCA AGGGGTATCT GGAGTTGATA GCCCATCATG GTTTTCGCCT
GGTTCAGAAT ATCGGTTTCG CCTTTATGGG GGAAGTCAAA AGGCAATACT GTTAAAGGAA
CTTATAGTCC GTCGAACACA GAGTCTCATT GCTACCCCTC AGCGTTTACC CGCCTCGGTT
TCTCTCACTA GTGCCACGAT CTATTGGAGT ACGGGTGATG GTTCATTTGG CCAAGTTTAT
GTGTCCAAAG ATGGAGGGGC AGAGACATTA ATGTCGCAAG GATCATATGG GGTTGATAAC
CCATCATGGT TTTCGCCTGG TTCAGAATAT CGATTCCGTC TCTATAAAGG TGACCAAAAG
GCGATTATTC TTGGGGAACT CGTACTTCGG CGAGAACAAA GTCTTATTGC TACACCAACC
TCTATTCCGA CAGGGATACA AGGGCCAATA ATGATTAATT TGTATTGGAG TACTGGAGAT
GGAACAGTTG GTCAAGTCTA TGTATCGAGA GATGGTGCAC CAGACACGTT GATGTCGCAA
GGCAGCTCTG GCCATGCAAA CCCTAGCTGG ATCATGCCTG GATCACGGTA TCGATTTCGC
TTGTATCGCG GTACTTCATT ATTAAGAGAG ATTATTATCC AATAA
 
Protein sequence
MMVRRRQVRL LAVVLFFCLM SILPLNASAF QAELVDAIKM EAYSTFHLED QKAILIENIR 
LDLPWASGTI VLLDPNVTAT TPHMGIFLGM YDKKEQRWTI AFRTNPQFQT WLPDVPLTVF
NETEKGFFSN PQRFVDGLAL LSLPWPAHES RYMSQGPHNY NGGNVNPLSS LDFTGGSGGV
YAAREGLVQV LCGGNKVFID HGDGWQTGYY HLTALDPRII TGSFVQRGQY LGQMGTGVGC
GGSATGNHVH FTLYKHSIPE DLDGKVIGGW IINGNCFIRN AETICNEGWV VSDGSIGSGT
SSQLQATPNP IPASTILESS TIRWNTGGIF GQVYVSKDGA PEVLMTQGVS GVDSPSWFSP
GSEYRFRLYG GSQKAILLKE LIVRRTQSLI ATPQRLPASV SLTSATIYWS TGDGSFGQVY
VSKDGGAETL MSQGSYGVDN PSWFSPGSEY RFRLYKGDQK AIILGELVLR REQSLIATPT
SIPTGIQGPI MINLYWSTGD GTVGQVYVSR DGAPDTLMSQ GSSGHANPSW IMPGSRYRFR
LYRGTSLLRE IIIQ