Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3971 |
Symbol | |
ID | 5735832 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5048908 |
End bp | 5049978 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641281121 |
Product | FkbH like protein |
Protein accession | YP_001546731 |
Protein GI | 159900484 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3882] Predicted enzyme involved in methoxymalonyl-ACP biosynthesis |
TIGRFAM ID | [TIGR01681] HAD-superfamily phosphatase, subfamily IIIC [TIGR01686] FkbH-like domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.692918 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGTGC AGCAGACCGC CGAGCTGATC AAAGAAGAGA AAAAAAACAA ATCGGTCAAG TGTGTCGTCT GGGATCTCGA CAACACGGTG TGGAAGGGCA TCCTGCTCGA AGACGAGCAT GTCACGCCCT TCCCCAACGT CGTCGCGGTG ATCAAAGAGC TTGACAGCCG CGGCATCCTC AACTCGATCT CAAGCCGCAA CGACCATGAC CTAGCCGTGG CCAAGCTCGA GGAGCTGGGC CTGCTTGAGT ATTTCCTGTA CCCTCAGATC AACTGGAACT CCAAGGCCTC CTCGATCAAG GAGATTGCCA AGCTGATTAA CATCGGCCTC GATACCTTCG CCTTCGTTGA CGACCAGCCC TTCGAGCGCG AAGAAGTCAC CTTCGAAATC CCCGACATTC TGTGCATCGA CGCGCTCGAC GGCGAGAAAA TCCTCGACAT GCCGGAGATG ATGCCGCGCT TTATCACCGA GGACTCCCGG CTGCGCCGCC AGATGTACCA GAGTGATATC TCGCGCAACG GGGCCGAGCA AGAGTTTCAA GGCTCCAACG AAGAGTTCCT GGCTACGCTG AAGATGGTCT TCACGCTGGC CCCGGCCCAG GAGGATGACC TGCAACGGGC CGAAGAGCTG ACCCTGCGCA CCAACCAGCT CAACACCACC GGCTACACCT ACTCTTACGA CGAACTCAAC GCGTTCCGCC ACTCGGGCCG CCACAAGCTC TACATCGCCT CGCTGGACGA TAAGTACGGC ACCTACGGCA AAATCGGCCT GACTCTGGTC GAGTGCGGCG AGGAGATTTG GACAATCAAG CTGCTGCTGA TGTCCTGCCG GGTCATGTCG CGGGGCGTGG GCACGATTAT GATCAACCAC GTTATGAACG AGTGCAAGCG GGCCGGGCGC CGCCTCCAAG CCGAGTTCGT TTCCAACAAC CGCAACCGCA TGATGTATAT CACCTATAAG TTCGGCGGCT TCAACGAGGT CAACCGGATC GATGATCTGG TGATTTTCGA GAACGATCTG TCCAATATCC AACCGTTCCC GGAGTACGTC AAGGTCAACA TTCTGGATTA G
|
Protein sequence | MAVQQTAELI KEEKKNKSVK CVVWDLDNTV WKGILLEDEH VTPFPNVVAV IKELDSRGIL NSISSRNDHD LAVAKLEELG LLEYFLYPQI NWNSKASSIK EIAKLINIGL DTFAFVDDQP FEREEVTFEI PDILCIDALD GEKILDMPEM MPRFITEDSR LRRQMYQSDI SRNGAEQEFQ GSNEEFLATL KMVFTLAPAQ EDDLQRAEEL TLRTNQLNTT GYTYSYDELN AFRHSGRHKL YIASLDDKYG TYGKIGLTLV ECGEEIWTIK LLLMSCRVMS RGVGTIMINH VMNECKRAGR RLQAEFVSNN RNRMMYITYK FGGFNEVNRI DDLVIFENDL SNIQPFPEYV KVNILD
|
| |