Gene Haur_3971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3971 
Symbol 
ID5735832 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5048908 
End bp5049978 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content57% 
IMG OID641281121 
ProductFkbH like protein 
Protein accessionYP_001546731 
Protein GI159900484 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3882] Predicted enzyme involved in methoxymalonyl-ACP biosynthesis 
TIGRFAM ID[TIGR01681] HAD-superfamily phosphatase, subfamily IIIC
[TIGR01686] FkbH-like domain 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.692918 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGTGC AGCAGACCGC CGAGCTGATC AAAGAAGAGA AAAAAAACAA ATCGGTCAAG 
TGTGTCGTCT GGGATCTCGA CAACACGGTG TGGAAGGGCA TCCTGCTCGA AGACGAGCAT
GTCACGCCCT TCCCCAACGT CGTCGCGGTG ATCAAAGAGC TTGACAGCCG CGGCATCCTC
AACTCGATCT CAAGCCGCAA CGACCATGAC CTAGCCGTGG CCAAGCTCGA GGAGCTGGGC
CTGCTTGAGT ATTTCCTGTA CCCTCAGATC AACTGGAACT CCAAGGCCTC CTCGATCAAG
GAGATTGCCA AGCTGATTAA CATCGGCCTC GATACCTTCG CCTTCGTTGA CGACCAGCCC
TTCGAGCGCG AAGAAGTCAC CTTCGAAATC CCCGACATTC TGTGCATCGA CGCGCTCGAC
GGCGAGAAAA TCCTCGACAT GCCGGAGATG ATGCCGCGCT TTATCACCGA GGACTCCCGG
CTGCGCCGCC AGATGTACCA GAGTGATATC TCGCGCAACG GGGCCGAGCA AGAGTTTCAA
GGCTCCAACG AAGAGTTCCT GGCTACGCTG AAGATGGTCT TCACGCTGGC CCCGGCCCAG
GAGGATGACC TGCAACGGGC CGAAGAGCTG ACCCTGCGCA CCAACCAGCT CAACACCACC
GGCTACACCT ACTCTTACGA CGAACTCAAC GCGTTCCGCC ACTCGGGCCG CCACAAGCTC
TACATCGCCT CGCTGGACGA TAAGTACGGC ACCTACGGCA AAATCGGCCT GACTCTGGTC
GAGTGCGGCG AGGAGATTTG GACAATCAAG CTGCTGCTGA TGTCCTGCCG GGTCATGTCG
CGGGGCGTGG GCACGATTAT GATCAACCAC GTTATGAACG AGTGCAAGCG GGCCGGGCGC
CGCCTCCAAG CCGAGTTCGT TTCCAACAAC CGCAACCGCA TGATGTATAT CACCTATAAG
TTCGGCGGCT TCAACGAGGT CAACCGGATC GATGATCTGG TGATTTTCGA GAACGATCTG
TCCAATATCC AACCGTTCCC GGAGTACGTC AAGGTCAACA TTCTGGATTA G
 
Protein sequence
MAVQQTAELI KEEKKNKSVK CVVWDLDNTV WKGILLEDEH VTPFPNVVAV IKELDSRGIL 
NSISSRNDHD LAVAKLEELG LLEYFLYPQI NWNSKASSIK EIAKLINIGL DTFAFVDDQP
FEREEVTFEI PDILCIDALD GEKILDMPEM MPRFITEDSR LRRQMYQSDI SRNGAEQEFQ
GSNEEFLATL KMVFTLAPAQ EDDLQRAEEL TLRTNQLNTT GYTYSYDELN AFRHSGRHKL
YIASLDDKYG TYGKIGLTLV ECGEEIWTIK LLLMSCRVMS RGVGTIMINH VMNECKRAGR
RLQAEFVSNN RNRMMYITYK FGGFNEVNRI DDLVIFENDL SNIQPFPEYV KVNILD