Gene Haur_4949 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4949 
Symbol 
ID5736785 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp6276438 
End bp6277535 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content52% 
IMG OID641282116 
Productpeptidase M20 
Protein accessionYP_001547707 
Protein GI159901460 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCAATCC GTACAAGTTT GGTTGATCTG ACCACGCGCT TGGTGGCGAT TCCCAGTGTT 
TCCGCCGAAA AACGCGATTT GCAGCCAGTG ATCGATCTGG TGGTGGCTGA ATTAGCCGAT
TATCCAGCGG CGTTGTTGCA TCATCGCGAT GCTAATGGCT ACCCAATGTT GGTGGTCAAT
TTCAACCAAG AACTGCGCAG CGATCTTATT TTGAATGCCC ATTTGGATGT TGTGCCAGCC
CGCCCTGAGC AATGGCACGC CTTCGAGCAT GATGGCAAAT TGTATGGTCG TGGCACGCAA
GATATGAAGG GATCGGCGGC GGTCTACATT GAAATTATTA AAGAAATTGC CCAATTGCCT
GCTGAGCAAC GCCCTAACGT AAGCTTTCAA TTTGTGACCG ATGAGGAAAT TGGCGGAGCA
AATGGCACAG CCTTATTGCG TGATGAAGGC TGGCAGGCTA ATTTATTTAT TGCTGGCGAG
CCGACCAACC TGAATATTTG TCATGGAGCC AAGGGCATTT TATGGCTGGC AGTTGAGCAA
CCAGGCGTGC CAGCCCATGG TTCGCGGCCT TGGGAAGGCG TGAATCCGAT TGAGCGTTTG
GCAAGTGGCC TTGGGCGTTT GTACGAATAT TATCCAACGC CTGCGCAAGA AATTTGGCGC
ACTACGGTTA CGCCTTCGAT TATCAAAGGC GGCGATGCTG GCAATCGGAT TCCAGCCAAT
GCCCAACTGA ATCTTGATAT TCGCTGGACA CCCGAAGAAG GTGCTGATGC GGTGATTGAT
AACGTGAAGC AAGCCTTTGC AACGAGCAGC GAACCCAATC CCAATGTGCA GATTTTGCAT
CGTGGCACGG CCCTAAATAC GCCAGCCGAG GAGCCAAACT TACAACGCAT TGTTGATGCA
CAACAATCCA GCCTTGGTCG CCAAGCCCAA CTCTTCCGCG AGCATTTTGG CTCCGATGCC
CGCTTCTACA GCGATGCCGG AATTCCAGCG GTCTGTTGGG GGCCAGAAGG TGCAGGCTTG
CATACCGACG ACGAGTGGGT CAGCATCGAT GGCTTGGTCG ATTATTATCA GGCGGTCAAA
ACCTTGTTGG GTATGTAG
 
Protein sequence
MSIRTSLVDL TTRLVAIPSV SAEKRDLQPV IDLVVAELAD YPAALLHHRD ANGYPMLVVN 
FNQELRSDLI LNAHLDVVPA RPEQWHAFEH DGKLYGRGTQ DMKGSAAVYI EIIKEIAQLP
AEQRPNVSFQ FVTDEEIGGA NGTALLRDEG WQANLFIAGE PTNLNICHGA KGILWLAVEQ
PGVPAHGSRP WEGVNPIERL ASGLGRLYEY YPTPAQEIWR TTVTPSIIKG GDAGNRIPAN
AQLNLDIRWT PEEGADAVID NVKQAFATSS EPNPNVQILH RGTALNTPAE EPNLQRIVDA
QQSSLGRQAQ LFREHFGSDA RFYSDAGIPA VCWGPEGAGL HTDDEWVSID GLVDYYQAVK
TLLGM