Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3024 |
Symbol | |
ID | 5734881 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3819698 |
End bp | 3820564 |
Gene Length | 867 bp |
Protein Length | 288 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641280168 |
Product | LmbE family protein |
Protein accession | YP_001545790 |
Protein GI | 159899543 |
COG category | [S] Function unknown |
COG ID | [COG2120] Uncharacterized proteins, LmbE homologs |
TIGRFAM ID | [TIGR03445] 1D-myo-inosityl-2-acetamido-2-deoxy-alpha-D-glucopyranoside deacetylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.439999 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGACC ATCGACCAAC CATTATGCTT GTGCATGCCC ACCCTGATGA TGAAGTGATT TCAACAGGCG GCATTGCGCT GAAATATCAC GAGCAAGGAG CACGGGTCGT TTTGGTGACC TGCACCCGTG GCGAAGAAGG CGAAATTGTT GATGCTGAAT TGAATACGCC TGAGAATAAA GCGCGATTGG CCGAGATTCG CGACGTAGAG TTGGCGCGAG CGATTGCCCT GCTCAATATC ACCGATTTTC ATCAGTTGGC CTACCGCGAT TCAGGCATGA TCAATACGCC TGCCAATGAG CATCCCGAAT CGTTCAACAA GGCCGATTTG CAGGCTGCTA CTGGTCGTTT GGTGGCCTTG GTGCGCCAAT ATCAACCAGA TGTGTTGATT AGCTACGATG AAAATGGCAG TTATGGGCAT CCCGACCACA TTATGGCCCA TAAAATTACG GTTGCGGCGT TTCATGCTGC TGGCGACCCC AGCCAATTTC CTGAGGCTGG GCCAGCCTGG GAACCGAAAA AACTGTATTA CACCGCCTAT CGCCGCTCGA TTTGGCGCAA AGCGTGGGAA TATATGCAAG CCCGCGGCGA AAAAACCCCG CTCGATGATC CCGAAGCAGA TCTTTCGCGC TATGTTGATG ATCCACGCAC CACCACTGCG ATTGAAATTG GCCAATATTT GCCAATTAAA TTGCAATCGT TGTTGGAGCA TCGCACCCAA ATTGGGCCAA CATGGGGCTT TTTGAGCTTG CCAGAGGAGT TACGCGGCGA GTTACTGAGC CATGAACATT TTGCCCTGAT CGAATCACGA GTGAGTTTGC CTGAGCGCGA AGAATTTGAA ACTGATCTTT TGCTTGGGAT TCGATAA
|
Protein sequence | MTDHRPTIML VHAHPDDEVI STGGIALKYH EQGARVVLVT CTRGEEGEIV DAELNTPENK ARLAEIRDVE LARAIALLNI TDFHQLAYRD SGMINTPANE HPESFNKADL QAATGRLVAL VRQYQPDVLI SYDENGSYGH PDHIMAHKIT VAAFHAAGDP SQFPEAGPAW EPKKLYYTAY RRSIWRKAWE YMQARGEKTP LDDPEADLSR YVDDPRTTTA IEIGQYLPIK LQSLLEHRTQ IGPTWGFLSL PEELRGELLS HEHFALIESR VSLPEREEFE TDLLLGIR
|
| |