Gene Haur_0492 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0492 
Symbol 
ID5732406 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp574177 
End bp575307 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content52% 
IMG OID641277618 
Productpeptidase M20 
Protein accessionYP_001543271 
Protein GI159897024 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0471333 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTCGTC AGCTTTTAAC CTATGTTGAT GCTTGCCTAC CCGATCTGCT GGATGAAATG 
CGTCAATGGA TCGAAATTGA ATCGTTTACC CGTGATATTA CGGCGGTTTC TTGGATGGTC
AACGTGGTTG GCGAGCGTTT GAGTAAGCTT GGTGCAAGTG TGCGCAAATA TAATGGCAAG
CCCCAAGCCG ACCATTTGTT GGCAAGTTGG CCAGGCGAGG GCGAACCATT GCTAATTGTG
GGCCATGTTG ATACCGTTTA TCCGCCAGGC ACGATTGATC AATTTCCGTT CCGCATCGAT
GGCGATGTGG TGCGTGGGCC TGGAGTCAGC GATATGAAGG GCTGTATTTT GCTGACTTGC
GCCGCCTTGC AAGCCTTACG CCACTTTAGC CGCTGGACCA GCCGCCCCTT GAAATTTTTA
ATTACGACCG ATGAAGAGAT TGGTAGCCCA ACCTCCCGAC GGTATATTGA AGAACAGGCT
CGCGGTTGTC GCGCAGCCTT GATTATCGAA TCAGCAGAAG AGGGTGGTTG GCTCAAAACA
TGGCGCAAAA GTGTCAGTAT GTATGACTTA ACAATTACTG GCAAGCCCTC GCATGCGGGG
GTAGCCCCGG AGCTTGGCAT TAGCGCGATT CACGAATTAA GCTACCAAAT TGGCCAGATT
TTGCCCTTGG CGCGGCCTGA AATTGGCACA ACGATCAATA TTGGCAAAAT TAATGGTGGT
ACTGCCACCA ATGTTGTAGC CGCCGAGGCC CATTGCACGA TCGATGTGCG GGCATTAAAA
GTTGGCGAGG CGGAACGGGT TGATCAAGCG CTGCATCAAT TAGTGCCCCA TTTGGCTGGC
GCAAAATTAA CTTTAGAAGG TGGCGTAAAT CGCCCAGCCA TGGAACAAAC GCCTGCCACA
ATGGCATTAT ATGCTGCTGC CGAGCAAATT GCCAATCAAT TGGATTTGCC GATTAAAGCT
AGTGGCACTG GCGGCGGTTC GGATGGCAAT TTCACGTCGG CGATCGGTGT GCCAACCCTC
GATGGGCTTG GTGGCTGGGG CAGTGATTCG CATAGCTTCG ATGAATGGCT TTCGATCAGC
CAATTTGCCC CACGGGCTGC CTTGCTGGCT CGTTTGATTG AGACATTGTA G
 
Protein sequence
MPRQLLTYVD ACLPDLLDEM RQWIEIESFT RDITAVSWMV NVVGERLSKL GASVRKYNGK 
PQADHLLASW PGEGEPLLIV GHVDTVYPPG TIDQFPFRID GDVVRGPGVS DMKGCILLTC
AALQALRHFS RWTSRPLKFL ITTDEEIGSP TSRRYIEEQA RGCRAALIIE SAEEGGWLKT
WRKSVSMYDL TITGKPSHAG VAPELGISAI HELSYQIGQI LPLARPEIGT TINIGKINGG
TATNVVAAEA HCTIDVRALK VGEAERVDQA LHQLVPHLAG AKLTLEGGVN RPAMEQTPAT
MALYAAAEQI ANQLDLPIKA SGTGGGSDGN FTSAIGVPTL DGLGGWGSDS HSFDEWLSIS
QFAPRAALLA RLIETL