Gene Haur_4178 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4178 
Symbol 
ID5736040 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5327805 
End bp5329490 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content50% 
IMG OID641281333 
Productpeptidase M3A and M3B thimet/oligopeptidase F 
Protein accessionYP_001546938 
Protein GI159900691 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1164] Oligoendopeptidase F 
TIGRFAM ID[TIGR02289] oligoendopeptidase, M3 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0341575 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTTGA AGATTGACCC ACGCGATTGG GCCACAGTTC AGCCCTACTA CGATTCACTT 
GCAGCCGAAG AGCTTACGGC TGCCAATGCA ACTGCTTGGT TAGGCCGTTG GAGTGAGTTG
GAAGCCCATT TGCAAGAATT CGGCTTCAAG GCCTATCGGG CGATGACCGA AAATACGCAG
GATCAGGCCG CCGAAGAGCT GTACCTCTAC ACAATTGAGG AGTTGACCCC GAAATCCAAG
GTTGCCGCCC AAACCTTGAA AGAAAAAATT TTGGCGCTTG ATCCAACGCT GTTGCCAGCC
AAAATGCAAG AAATGCTGCG GCGATTTCAT GCTGAAGCCA ATTTATTCCG CGAAGAAAAT
GTGCCACTAT TGACCGAAGT TGAGCGCCTG AGTGCGGAAT TTAGCAAATT GATGGGTGCA
ATGAGCGTTG AGTGGCAAGG CGAAACCCTG ACCATGCAAG CAGTCGTTAA GCTGTTTAGC
GACCCAGATC GCAGTGTGCG CGAAGCTGCA TTTAAGGCTT ATCATAGCCG TTTTTTGCAA
GACCGAGCAG CTTTCAACGA GATTTATCTC AAGCAACTTA AGCTTCGTCG CCAAATTGCC
ACCAATGCAG GTAAAGCCAG CTATTTGGAT TATGTCTGGG ATCTCTATGG GCGCTTCGAT
TATACGCCTG CCGATTGCCG CACCTTCCAC GATGCAATTG AACATGAAGT TGTGCCGTTG
GTTGAAAAAT GGTTGGCAGT CCATCAAGCC GAATTAGGCG TTGATAGCTT GCGACCATGG
GATATTTTGG TTGATTCGCA ATCACGCCCA GCTTTAACTC CATTCCAAAG CGCCGATGAA
CTCGAAAGCG TCTGCCAAAC AATCTTCGAT CGCGTTGATC CTGAACTGGG AGCGCAATAC
ACTCGCATGC GTGATGGCTG GCTTGATTTG GATTCACGGC CTGGTAAAGC ACCTGGTGGT
TATTGTGGCG GGATGTTTGT TTCCAAAGTG CCCTACATTT TTATGAATGC GGTTGGCACC
CATGATGATG TCCAAACCAT GTTGCACGAA GGTGGCCATG CTTTCCACTT TATGGAATCA
TCGGCAACCA ACGATTTGAT TTGGAATTAC GATGGGCCAA CTGAATTTTG TGAAGTCGCT
TCGATGGCGA TGGAATTGTT GGCAGCGCCC TATTTAGCCA ACGCTAAGGG CGGTTTCTAC
AGCGAAACTG ATGCCCGTCG TGCTCGTGCA GAGCATCTTT GGAGTATGCT CAAATTCTTG
CCCTATATGG CGACCGTCGA TGCCTTCCAA CATTGGGTCT ACATTGAAGC GCCTGAAGAT
GTAACGGCTG AGCAGCTTGA TGCCGTGTGG AAAGACCTCT ATCAGCGCTT TATGAGTGGG
GTGAATTGGG ATGGCTTCGA AGATGTACAG ACCACAGGCT GGCATCGCAA GCAGCATATT
TTTGAAGCTC CGCTCTATTA TGTAGAGTAT GGCTTAGCCC AATTGGGGGC GCAACAAGTT
TGGCGCAACG CTTTAGCTGA TCAAGCGCAG GCCGTAGCAC AATATCGCGC AGCCTTAGCC
TTGGGCAATA CCAAACCACT GAGTCAATTA TTTGCAGCGG CTGGGGCAAA ATTTGCCTTC
GATCGGGCAA CTGTCGGCGA ATTAGCCCGC TTAGTCGATC AGCATCTGAC CACACTCCGT
GATTAA
 
Protein sequence
MSLKIDPRDW ATVQPYYDSL AAEELTAANA TAWLGRWSEL EAHLQEFGFK AYRAMTENTQ 
DQAAEELYLY TIEELTPKSK VAAQTLKEKI LALDPTLLPA KMQEMLRRFH AEANLFREEN
VPLLTEVERL SAEFSKLMGA MSVEWQGETL TMQAVVKLFS DPDRSVREAA FKAYHSRFLQ
DRAAFNEIYL KQLKLRRQIA TNAGKASYLD YVWDLYGRFD YTPADCRTFH DAIEHEVVPL
VEKWLAVHQA ELGVDSLRPW DILVDSQSRP ALTPFQSADE LESVCQTIFD RVDPELGAQY
TRMRDGWLDL DSRPGKAPGG YCGGMFVSKV PYIFMNAVGT HDDVQTMLHE GGHAFHFMES
SATNDLIWNY DGPTEFCEVA SMAMELLAAP YLANAKGGFY SETDARRARA EHLWSMLKFL
PYMATVDAFQ HWVYIEAPED VTAEQLDAVW KDLYQRFMSG VNWDGFEDVQ TTGWHRKQHI
FEAPLYYVEY GLAQLGAQQV WRNALADQAQ AVAQYRAALA LGNTKPLSQL FAAAGAKFAF
DRATVGELAR LVDQHLTTLR D