Gene Haur_1044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1044 
Symbol 
ID5732948 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1191596 
End bp1192603 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content52% 
IMG OID641278179 
Productalpha/beta hydrolase fold 
Protein accessionYP_001543820 
Protein GI159897573 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATATAA AACCACAAAA ACGCTCGCTG TTACGACGGA TTGGGCGCTG GCTCGCTTGG 
CTTGGCCTGT TGATTGTTGG TTTGCTGATC GGCGGTTGGG CGTTTCAGCG CTGGGCCAGC
CAGCGTGATC GCCAACAATT TTTGCCAGCC GAGCAGCAAA TTATGCTCAA TGGCCATGCG
ATGCGACTGA TTTGTATGGG CAGCGGCAGC CCAACGATCG TGCTCGAATC TGGCTTAGGC
GATGGTGCTG ATGTTTGGGG CTTAGTCCAA CCTGCCTTAG CCGAGCAATA TCGGGTTTGT
GCCTATGATC GAGTTGGCAT GGGCTGGAGT GCAGCGGTAG CCAACAAGGC TGATCGGGCT
TCGATTGCCC AAACCTTGCA TGAACTGCTG AGCCAAGCCA ACGTATCAGC GCCATATGTA
TTGGTTGGCC ATTCGGCTGG TGGTTTGTAT GTGCGCGAAT ATGCCCAGCG CTACCCTGAG
CAAGTTATTG GTTTGGTGCT GGTCGATTCA TCGCACGAAC AACAACGCCA ACGTCAACCA
CAGCTTGCTG AAGATCCATT TGCAATCATG CGTCAGTCGA TGCAAGCCTG TGATGCCTTA
GCGCCATTCG GAATTATTCG GCTGACAAAG CTGTTTGAGC AATCGCAATC GACCTATGCC
AAACTTCCAC AACCAGCTCA AGCCTCGATT GCAGCTAGCC AATACCAAAC GAGCACCTGT
AGCGCGATGG ATGCGGCCTT GGCAGCAATC ACCCAAGATC TGAATCAAGC CCAAGCTCCG
CAATCGCTAA AGGATCTCCC GTTGGTGGTA TTAACCCGTG GGATTGCTGA TAGCACCATG
CCAGCGGAAT TTGAACAGAC GTGGGATAGC TTGCAACAAG AATTAGCTCA GCTTTCGAGC
AACAGCCAAC ATCATATAGC TGAAACCAGT GGTCATTACA TTCATCTTGA TCAACCAGCG
TTGGTGATCG AGGCAGTTGA ATGGGTAATC AGCCAACAAG CTAAATAG
 
Protein sequence
MNIKPQKRSL LRRIGRWLAW LGLLIVGLLI GGWAFQRWAS QRDRQQFLPA EQQIMLNGHA 
MRLICMGSGS PTIVLESGLG DGADVWGLVQ PALAEQYRVC AYDRVGMGWS AAVANKADRA
SIAQTLHELL SQANVSAPYV LVGHSAGGLY VREYAQRYPE QVIGLVLVDS SHEQQRQRQP
QLAEDPFAIM RQSMQACDAL APFGIIRLTK LFEQSQSTYA KLPQPAQASI AASQYQTSTC
SAMDAALAAI TQDLNQAQAP QSLKDLPLVV LTRGIADSTM PAEFEQTWDS LQQELAQLSS
NSQHHIAETS GHYIHLDQPA LVIEAVEWVI SQQAK