Gene Haur_5249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5249 
Symbol 
ID5737207 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009974 
Strand
Start bp19030 
End bp20289 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content61% 
IMG OID641282413 
Productpeptidase M23B 
Protein accessionYP_001548004 
Protein GI159901759 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.973521 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGAGC GAGCCATTGG ATCAAGCATC CTTGTGGCAA GCCTGATTGT TTTGTTGGCA 
TGTGCGCCGA CCTACGGCGA CGACGACCAG CCGCCAACGA TTGTGGTCAA CTGGCAATGC
CCCACGCCCT CACCGCTGCC AACGATCCAA TCGGGGGTAT TGCCTACGCC GGAACCGCCG
ATCAACGCGA CCTACGTGCC AGGGCAGGAG CCAACTGCCG AAGCGATCTA CACGACCCCG
CTGCCCACGG CGACCCCGTA TGTGCGAACC GGCAGTGATT ACTATGTCAA CCAGCGGATT
CAAATCAATC GCTATACCTT ACGCGTGACC AGCTACCGGA CGCAGCCTGC GAGCAACGGC
AATGCTTACC ATCTGGTGAC GCTGGCCTTA GAAAACCCCA CCCAAACGCA ATGGCCGCTG
TATCTGGACT TCTCGCAGCT GCGGGCGATC AAAGGAACGG ACGGACGCAT GATCCAAGCG
ACGTGGTATC CGAGCGAGGA GGCCGCGCAG CAGTTGGGCA TTAGCCCCGC CAAAGACCCG
CCGCTTGAGG AAGTTGGCGA CACGTTGGTG GGTGGCTATC CCATCGGCAC GAGTAGCCGC
ACGCTGGTGT TTGAAGCGCC CGCAGGCGAC GCGCAGGCGT GGGGAATTAC GTTGACCAAC
GAGGACACCA GCCGCGATGA CGGCGCGGGC AGTGGCCAAG TCTGGGTATT ACTGCGCAGC
GACCCCAACT GTACCACGGG CGTAGGCGGC GGAGCCAGTG AGGGCGGCAC GATTCCGACC
GCAGGCACAC CGAGTACCGG ATCAGGCCGT TGGCCTGTGC CGCTGGATAC GCCAATTACC
CGTGCTTTCG GCTGCCACGG GTTTTTCACG GGGACGCGCG GCCCCTGTGG CGGCGCGACC
CCGTGGTGGC ATGATGGCAT CGACTTCGCC AAGCCCGAAG GCACGCCGCT GTATGCGACC
CGTGACATGA CGGTGCTGTT TGCCGGACGC GACACCTCGA CGATTGATTG TTCGGGGATC
AGCGGGTCGC GTTTCCCCCA TTTCGGCTTT GGAAATTATG TCAAAACCCA AGATAGCCTC
GGTTTTTCGT ACTGGTATGG GCACGTTTCG GCGTGGTCAG TCAGTGCAGG CCAAACCGTG
CGGGCAGGCC AGCAGATCGC GTCGATGGGA TCAACGGGCT GTTCCACCGG ATCACACCTG
CACTTTCGGG TGCGGCTGAA TGGGTTGGAC AGAAATCCTT TAGACGTTAT TTCGAAATAA
 
Protein sequence
MNERAIGSSI LVASLIVLLA CAPTYGDDDQ PPTIVVNWQC PTPSPLPTIQ SGVLPTPEPP 
INATYVPGQE PTAEAIYTTP LPTATPYVRT GSDYYVNQRI QINRYTLRVT SYRTQPASNG
NAYHLVTLAL ENPTQTQWPL YLDFSQLRAI KGTDGRMIQA TWYPSEEAAQ QLGISPAKDP
PLEEVGDTLV GGYPIGTSSR TLVFEAPAGD AQAWGITLTN EDTSRDDGAG SGQVWVLLRS
DPNCTTGVGG GASEGGTIPT AGTPSTGSGR WPVPLDTPIT RAFGCHGFFT GTRGPCGGAT
PWWHDGIDFA KPEGTPLYAT RDMTVLFAGR DTSTIDCSGI SGSRFPHFGF GNYVKTQDSL
GFSYWYGHVS AWSVSAGQTV RAGQQIASMG STGCSTGSHL HFRVRLNGLD RNPLDVISK