Gene Haur_2774 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2774 
Symbol 
ID5734655 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3531480 
End bp3533195 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content51% 
IMG OID641279917 
Productpeptidase M23B 
Protein accessionYP_001545540 
Protein GI159899293 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCCGCC ATTCACTCAG CCCTAATCGA TCATTGCAAC GTGCTTGGCG TTTTGATCGT 
GGATTACCTA AGCCATTGTT ATTGCTGTTA TTGCTTGATT TATGTCTGCT TTTGCCAGTG
CCATGGCTCT TGCCAAGCGT GTTTGATCCA CAGCCACATA GCCAACCTAG CACAAACCTA
GCAATCGAAG TTCCAAGCGC AACTGCAATA CCCCAAGGTG CAGGCATTGG CAGCGCAGCC
GAACAACGCC AATTGCCATT GTTAATTGAT CCAGTGCGGT TGCAAGATGC TCGCTATGGT
GATGCCCAAA CTATTGCTAG GTTGCTGGCT AAATATTGGC CCAGCTTGGC TAAGCAAACC
TTTGATGTGG GCAATGGCAT CCAACAAGAT GCTGCCAATA TCATCACTGG CCAAGCCTTG
CGCTGGAATA TTAATCCACT CATTTTAGTG AGTTTGTTGG CAATTAATTA CGATGTTGCC
AATTCGCCAC GGTTTGAACA GGCGTTTGGC GGCCCTGAAA TGGGCTTTCC CCAACAAGTA
CTCTGGGCAA CAATCGAGCT GCGCGAAGGC TTAGACCAAC CAATCACCAA CACATTTCAA
CTCCAGCATG GCGGTTGGTG GTCAACTGAA CAACCATTAG ATCGAGCGAA TGCCAGTTTG
TTGCGAGTAT TGGCGCAAAC CCGTAATCAA ACTGAGCTTG AGCAATTGTT GGGTGCTGGT
AACCAGTCAT GGGTCGCGCA GGTAACCCCA ATTATGGGTG ATCCGCGTCA GCCAATCTAC
AATCCAGTCA TCGATCAGCC GTTTATGGTC ACGCCGTTTA ATGCTGCCAA TCATCCAATT
GCCCAATTCG ATCATCAGTA TCCTTTAATT AACGCCGATG GCACGATGCT TGGCAATGGC
TTGAGTTTCG AGTTAGGCTA CGATGGGCAT AATGGCTGGG ATTACGCTCT ACCTGCCGAT
AGTCCGATTT TGGCTGTGGC AGCAGGCACA GTGCTATTTG CAGGCTGGGT TGATAGTGGT
TGCGCTACAC CAGCAGGCGT GGTAGTGGTT CAACATCTCA ATGGTTATCG CAGCGCATAT
TGGCATTTGG GGCGGGTTGA TGTTCAAGCA GGTCAGCAGG TGGCTCAATC GGAGTTACTG
GGGCTGATTG GCCAAACTGG TTGTAGCGTT AATGATCATC TTCATTTGAG CATTCAGCGC
TTAGGTCGCG ATGTTAACCC TGCTGGCTGG TGTAGCACCT TGCCCGATCC ATGGGCTGCC
CATCCAGCCG GAGCCAGCAG CCGTTGGCTA TGGCTCGATC AGGTCGATAG TTGCAACCAG
CCAGCCATGA GCAGTTTGGC CGATGATAGC GATCCAGCCA CGACCACGCG CTATGGCAGT
GGCTGGCAAA TAGATTCAGC AGGCAATTTC AATGGCTCAC ACTGGACAAG CACTGGTGGT
CAAACCCTCT GGCGGCCATG GATCGCCCAA GCGGGTCGTT ATCGTGTAAT GGTCTTTATT
CCCAATGTGG CGACCAAGGC TGGCACGGCC CATTATCGGA TTGCCCATAG CGATGGCATG
AGCGAAATAG TGATCGAGCA GGCCAAGCAT GCTGGCAGTT GGCTCAGTTT GGGCGATTTT
TGGTTTGATC CAGGCCAAAT TGGCCGCATT GGGCTTAGCG CCATACCAGG CACGTTAACA
TGGGCCGATG CAATTGCCGT TCAATCGTTA AATTAA
 
Protein sequence
MARHSLSPNR SLQRAWRFDR GLPKPLLLLL LLDLCLLLPV PWLLPSVFDP QPHSQPSTNL 
AIEVPSATAI PQGAGIGSAA EQRQLPLLID PVRLQDARYG DAQTIARLLA KYWPSLAKQT
FDVGNGIQQD AANIITGQAL RWNINPLILV SLLAINYDVA NSPRFEQAFG GPEMGFPQQV
LWATIELREG LDQPITNTFQ LQHGGWWSTE QPLDRANASL LRVLAQTRNQ TELEQLLGAG
NQSWVAQVTP IMGDPRQPIY NPVIDQPFMV TPFNAANHPI AQFDHQYPLI NADGTMLGNG
LSFELGYDGH NGWDYALPAD SPILAVAAGT VLFAGWVDSG CATPAGVVVV QHLNGYRSAY
WHLGRVDVQA GQQVAQSELL GLIGQTGCSV NDHLHLSIQR LGRDVNPAGW CSTLPDPWAA
HPAGASSRWL WLDQVDSCNQ PAMSSLADDS DPATTTRYGS GWQIDSAGNF NGSHWTSTGG
QTLWRPWIAQ AGRYRVMVFI PNVATKAGTA HYRIAHSDGM SEIVIEQAKH AGSWLSLGDF
WFDPGQIGRI GLSAIPGTLT WADAIAVQSL N