Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2301 |
Symbol | |
ID | 4268399 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 2612546 |
End bp | 2613859 |
Gene Length | 1314 bp |
Protein Length | 437 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 638127061 |
Product | HipA domain-containing protein |
Protein accession | YP_743133 |
Protein GI | 114321450 |
COG category | [R] General function prediction only |
COG ID | [COG3550] Uncharacterized protein related to capsule biosynthesis enzymes |
TIGRFAM ID | [TIGR03071] HipA N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.752319 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.112429 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCGGC GTCGGCGCCA CCCTCCGCTA CACGTCCTGC TGAACAACCG CCACGTCGGC CAGCTCCAAA AGGCGGTGGA CGGTGCAATC AGTTTTACCT ACGAGCAAAA CTGGCTGGAC TGGGACCACG CCCTACCCGT CTCGCTCTCC CTTCCCCTCC GCGAAGACCC CTACCGGGGT GCACCGGTGG CGGCAGTGTT CGACAACCTC CTGCCCGATG CCGAGCCGCT CCGCCGCCGC GTTGCCGAGC GCGTTGGCGC CGAGGGCACC GACGCCTACA GCCTGCTCTC AGCCATCGGC CATGATTGCG TCGGTGCCCT GCAATTCGTC GGCCCCGACG CCCCGGCCCC CGGCGACACC ACCCAAATTT CCGGCCAGGT CATTGACGAG GACGACATCG GGAGGCTGCT CCGGGGGCTG GCCCAGGCCC CACTGGGGCT GGACCGCGAC GAGGCATTCC GGATCTCCAT TGCCGGGGTG CAGGAGAAGA CCGCCCTGCT CCGGCATGAA GGCCGCTGGC TAAAACCCCA CGGCACAACC CCGACCAGCC ACATCCTCAA GCCTCAGATC GGCCAGTTGC CGAACGGCAT CGACCTGTCC AACAGCGTCG AGAACGAATA CTACTGCCTC AAACTGGCTG CCGCCTTCGG GTTGCCCGTC AACAGGGCCG AGATCCACAC CTTTGGGCCC ACCCAAGCAC TCGTCGTCGA GCGCTTCGAT CGCCACTGGA CCCACGACGG CCGCCTGCTC AGACTCCCGC AGGAGGACTG CTGCCAGGCC CTATCCGTCC CGCCAACACG CAAGTACCAG ACCGAGGGTG GCCCCGGTAT CGTGCAACTT CTTGAACTGC TCAACGGCAG CGACACCCCG GCCAAAGACC AGGCGACCGT CTTCAAGGCA CAGATCTTCT TCTGGCTGAT CGGCGCTACC GACGGGCACG CAAAGAACTT CAGCCTGTTT CTGCGGCCAC AGGGCGCGTT TCGCCTGACC CCGCTGTACG ACATCCTGAC CGTCCAGCCG AGCCTTGCCG GCCGGCAAAT CGAACGCAAG CAGATGAAAC TGGCCATGGC CGTGGGGCGC GGAAACCGCT ACCGGATCCA TGAAATCCAG GGCCGCCATT TCCTACAGAC CGGCGCCGCC GCCCGCCTGC CGCGCACCTT GGCCACCAAC GTCATCGAGG ACATAGTGAC CCGCGCGGAC AACGCCATCA CGCAGGTCGA AAGCGCCTTG CCCCCCGACT TCCCCCCGGC AATCCACGAA AGCGTGAAGG CGGCCATCGC CGGGCGCCTG GGGGTATTGC AGAGGGCGGG GTGA
|
Protein sequence | MPRRRRHPPL HVLLNNRHVG QLQKAVDGAI SFTYEQNWLD WDHALPVSLS LPLREDPYRG APVAAVFDNL LPDAEPLRRR VAERVGAEGT DAYSLLSAIG HDCVGALQFV GPDAPAPGDT TQISGQVIDE DDIGRLLRGL AQAPLGLDRD EAFRISIAGV QEKTALLRHE GRWLKPHGTT PTSHILKPQI GQLPNGIDLS NSVENEYYCL KLAAAFGLPV NRAEIHTFGP TQALVVERFD RHWTHDGRLL RLPQEDCCQA LSVPPTRKYQ TEGGPGIVQL LELLNGSDTP AKDQATVFKA QIFFWLIGAT DGHAKNFSLF LRPQGAFRLT PLYDILTVQP SLAGRQIERK QMKLAMAVGR GNRYRIHEIQ GRHFLQTGAA ARLPRTLATN VIEDIVTRAD NAITQVESAL PPDFPPAIHE SVKAAIAGRL GVLQRAG
|
| |