Gene Haur_4682 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4682 
Symbol 
ID5736529 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5979222 
End bp5981042 
Gene Length1821 bp 
Protein Length606 aa 
Translation table11 
GC content51% 
IMG OID641281846 
Productpeptidase M3A and M3B thimet/oligopeptidase F 
Protein accessionYP_001547441 
Protein GI159901194 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1164] Oligoendopeptidase F 
TIGRFAM ID[TIGR02290] oligoendopeptidase, pepF/M3 family 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTACCC AAACGCTCCC ACACTGGGAT CTTTCAGTTG TCTACCCCAG CCTAGACTCG 
CCTGAATTTG CCGCAGGCTT TCAACGGATC AAGCAACAAG TGAGCGATCT CAAAAGCCTG
TTTGATCAAG CTGCTATCCA ACGTAGTGAA AACCAAGTGC TTGATTCCGC CACCATCGCG
ACCTTTGAAA CCGTAACTAC TGCCTTCAAC CAAACCCAAG ATGCGATGAT GACCAATTTT
AGCTATGTGA TGGGCTTTGT CGCCACCGAC ACCCGTGATA CGCTAGCCCA AGCTCGGTTT
AGCGAAATGC AACAATTGGG CATGCAATTG CAACAATTGG GTCTGCGCTG GGTCGCTTGG
ATTGGTGGCT TGGATATTGA TCAATTGATT GCTCAATCAA GTTTGGCCGC CGAGCATGCC
TTTATTTTGC ACAAAACCAA GCAAGAAGCG TTGCACCTGA TGTCGCCAGC CGAAGAAGTG
TTGGCAGTCG AACTCAATTT GAGCGGTGGC AGCGCTTGGA GCAAATTGCA CACGGTGGTT
AGCTCACAGC TTAGCGTGCC AGTCACGCAT GCTGGCGAAA CCAAGCATAT GCCAATGAGC
ATGGTACGCA ATTTGGCGTT TGATCCAAAT CGTGAAGTTC GGCGCAGCGC TTACGAGGCC
GAATTGGCTG GCTGGAAAGG GGTCGAAACG CCCATGGCAG CGGCAATCAA CAGCATCAAA
GGCCAAGTTA ACACGTTGGC TGGCCATCGT GGTTGGGCTA CAGCGTTGGA TGAGGCGCTG
GCGGATAACA ATATTGATCG CCAAACCCTT GATGCAATGT TAACCGCCGC CCGCGAAACC
TTCCCAGATT TCCGCCGCTA CTTGCGGGCC AAAGCACGGT TGGTGGGCAG CGAACGGTTG
GCATGGTTCG ATCTCTTCGC GCCGATTGGC AACGATGTCA GCACTTGGGA ATACGCCGAA
TCGGAAGCCT TCATTTTGGA GCATTTCGGC AGCTATTCAC AACGCTTGCG TGATTATGCC
GCCCGCGCCT TCGATGAAAA CTGGATCGAT GCTGAGCCAC GGGCTGGCAA ACGTGATGGC
GCGTTCTGTA TGCGTTTAGT TGGCGATCAA TCACGCATCC TGAGCAACTA TAAGCCCTCG
TTTGCTGGAA TGCGCACCCT TGCACATGAG TTGGGTCACG GCTATCATAA CCTTAACTTA
GCCGAAACCA CCCCATTGCA GCGCCAAATT CCGATGACCT TGGCTGAAAC TGCCAGCATT
TTCTGTGAAA CGATTGTGCG CAACGCCGCC TTGGCTAAGG CTGATGCTGC AACCCAATTA
GCAATCATCG AATCATCGTT GCAAAATAGC TGCCAGTTGG TTGTCGATAT CAGCAGCCGC
TTTATTTTCG AGCAAAGCCT GTTCGAAGCC CGTCAAGCGC GTGAGCTAAG CGTGCAGGAA
ATTAACGACC TAATGCTCAA GGCCCAAGCT GAAACCTATG GCGATGGCTT GGACGAACAG
TACTACCATC CGTATATGTG GGCGGTCAAA GGTCATTACT ATAGCACTGA ACGCTCATTT
TACAATTATC CCTATATGTT TGGCTTATTG TTCGGCTTAG GCTTGTATGC CCAGTATGTA
GCAACACCCG ATGAGTTTCG GGCCAAATAC GATGATTTAT TGGCGGCTTC GGGCAAGCAT
GACGCAGCAA CCTTGGCCGA ACGTTTTGGG ATTGATATTC GGACCCCCGA TTTTTGGCGG
GCAAGTTTGG CCACCATCAA GGCCGATATT GACCGTTTCG TTGAGCTAAG CGAGCAGTTG
TTGGATACCG CTGCCCACTA A
 
Protein sequence
MTTQTLPHWD LSVVYPSLDS PEFAAGFQRI KQQVSDLKSL FDQAAIQRSE NQVLDSATIA 
TFETVTTAFN QTQDAMMTNF SYVMGFVATD TRDTLAQARF SEMQQLGMQL QQLGLRWVAW
IGGLDIDQLI AQSSLAAEHA FILHKTKQEA LHLMSPAEEV LAVELNLSGG SAWSKLHTVV
SSQLSVPVTH AGETKHMPMS MVRNLAFDPN REVRRSAYEA ELAGWKGVET PMAAAINSIK
GQVNTLAGHR GWATALDEAL ADNNIDRQTL DAMLTAARET FPDFRRYLRA KARLVGSERL
AWFDLFAPIG NDVSTWEYAE SEAFILEHFG SYSQRLRDYA ARAFDENWID AEPRAGKRDG
AFCMRLVGDQ SRILSNYKPS FAGMRTLAHE LGHGYHNLNL AETTPLQRQI PMTLAETASI
FCETIVRNAA LAKADAATQL AIIESSLQNS CQLVVDISSR FIFEQSLFEA RQARELSVQE
INDLMLKAQA ETYGDGLDEQ YYHPYMWAVK GHYYSTERSF YNYPYMFGLL FGLGLYAQYV
ATPDEFRAKY DDLLAASGKH DAATLAERFG IDIRTPDFWR ASLATIKADI DRFVELSEQL
LDTAAH