Gene Haur_4751 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4751 
Symbol 
ID5736595 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp6058133 
End bp6059950 
Gene Length1818 bp 
Protein Length605 aa 
Translation table11 
GC content51% 
IMG OID641281916 
Productoligoendopeptidase F 
Protein accessionYP_001547510 
Protein GI159901263 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1164] Oligoendopeptidase F 
TIGRFAM ID[TIGR00181] oligoendopeptidase F 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAACCG CCGCAACAGT GCCAAGCCGT CAGGAAGTAG CCCTCGCCGA TACTTGGGAT 
GCAACCAAAG TTTTCGCTTC TGATGCTGAA TGGGAACAAG CACTTGATGC CTATGCCCAA
TCGATCGCCG ACATCGAGCA ATACCAAGGC CGTTTGGCTG AAGGCCCAGC AACGCTTTTA
GCCGCGCTGA ATTATTCCGA TCAGCTTAAC GAGCGGGTTG GCAAGCTCTA TATTTATGCC
AGCCTGTTTT ATAGCGTCGA TACCACCGAC CAAGTAGCCA AAGCCAAAAT GGATCGGGTG
TTGGGGGTTT ATTCGCGCAC CATGGCTAGC TCGGCGTTTA TCGAGCCAGA AATTATTGCG
ATTGGCTTTG CAACGCTCAA GCAATGGCAA GCCGAAAACG CCGATTTGGC TCGCTATGGT
CAATATTTTC ATGATGTTGA GCGGCGGCAA GCTCATGTGC GTTCAGCCGA AGTTGAACAA
GTATTGGGCT TGGTCAGCGA TGCGTTCGAT TCTGCCAGTT CAACCCATCG CATTTTGACC
GATGCTGATT TGGCCTATCC CGCAGCCCAT AGCACGAATG GGCTTGAATT AGAGCTAACT
CAAGGCACAA TTGGCCTGTT GGTCACCGAC CCTGATCGGG AGGTGCGCCG CACAGCCTTT
GAAAACTACA GCGATGCCCA TCTGGCCAAC AAAAATACCA TGGCCAATTG TTTGGCGACT
TGTGTCAAGC AGCATGTCTT CAATGCTCGC GTGCGCAACT ATAGCTCAGC TTTAGCCGCC
GCCCTTGGCG AAAACAACAT CCCAACCGAT GTCTTTCATC AATTGGTTGC CACGGTGCGC
AAAAACTACC CAACTTGGCA TCGCTACTGG CGCTTACGCC GTCAAGCCTT GGGCTATGAT
CAATTGCATG TCTACGACAT CAAAGCGCCA TTAACCACCA AAAAGCTTGA AATCCCTTAC
ACTGAAGCCG TCGATTGGAT TATTACGGGG ATGCAGCCGT TGGGCGAAGG CTACACCAGC
GTGGCCAAAC GTGGCCTGTT AGAAGATCGC TGGGTTGATA TTTACCCTAA CAAAGGCAAG
AGTGCTGGGG CATTTTCGAG TGGTTGGAAG GGCACGCCAC CTTATATCTT GATGAACTAC
AACAACGATA TTTTTGGGAT GAGCACCTTG GCCCACGAGC TTGGTCACTC GATGCACTCG
TATTTGACCT GGCAAAACCA GCCAACAATT TATAGCAACT ATGGTTTGTT TGTGGCCGAA
GTAGCTTCAA ACTTCAATCA AGCTCTAGTT CGCAACCACC TGTTTAATAC CAACCAAGAC
CCCGATTTCC AAATTGCCTT GATCGAAGAA GCCATGAGCA ATTTCCATCG CTATTTCTTC
ATCATGCCAA TTTTGGCCCA ATTTGAGCTA GAAATCCACG AGCGTGGTCA GCGTGGCGAA
CCATTAACCG CTGCAACCCT CAACGGCATC ATGTTCGATC TGTTCCGTGA AGGCTATGGC
GAGGAAGTTG TGCCCGATGC TGATCGTATG GGGATTACCT GGGCAACCTT CTCGGGCCAT
ATGTATGCCA ACTTCTATGT CTATCAATAT GCCACGGGGA TTGCCGCCGC TCATGCGTTG
GCCGAGGGGG TATTGGCGGG CAAGCCCGAT GCCCAAGCCA ACTATTTGGC CTTCTTGAGT
GCTGGTAGCT CGTTAGCACC ATTGGATGCG CTGAAATTGG CTGGGGTCGA TATGACTTCC
GCCGAGCCAG TTGAAGCCGC CTTCCGCGTG CTGGCCAGCT ATGTTGATCG CCTAGAACAA
TTGCTGGGCA ATCAATAA
 
Protein sequence
MTTAATVPSR QEVALADTWD ATKVFASDAE WEQALDAYAQ SIADIEQYQG RLAEGPATLL 
AALNYSDQLN ERVGKLYIYA SLFYSVDTTD QVAKAKMDRV LGVYSRTMAS SAFIEPEIIA
IGFATLKQWQ AENADLARYG QYFHDVERRQ AHVRSAEVEQ VLGLVSDAFD SASSTHRILT
DADLAYPAAH STNGLELELT QGTIGLLVTD PDREVRRTAF ENYSDAHLAN KNTMANCLAT
CVKQHVFNAR VRNYSSALAA ALGENNIPTD VFHQLVATVR KNYPTWHRYW RLRRQALGYD
QLHVYDIKAP LTTKKLEIPY TEAVDWIITG MQPLGEGYTS VAKRGLLEDR WVDIYPNKGK
SAGAFSSGWK GTPPYILMNY NNDIFGMSTL AHELGHSMHS YLTWQNQPTI YSNYGLFVAE
VASNFNQALV RNHLFNTNQD PDFQIALIEE AMSNFHRYFF IMPILAQFEL EIHERGQRGE
PLTAATLNGI MFDLFREGYG EEVVPDADRM GITWATFSGH MYANFYVYQY ATGIAAAHAL
AEGVLAGKPD AQANYLAFLS AGSSLAPLDA LKLAGVDMTS AEPVEAAFRV LASYVDRLEQ
LLGNQ