Gene Mlg_2301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2301 
Symbol 
ID4268399 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2612546 
End bp2613859 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content66% 
IMG OID638127061 
ProductHipA domain-containing protein 
Protein accessionYP_743133 
Protein GI114321450 
COG category[R] General function prediction only 
COG ID[COG3550] Uncharacterized protein related to capsule biosynthesis enzymes 
TIGRFAM ID[TIGR03071] HipA N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.752319 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.112429 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCGGC GTCGGCGCCA CCCTCCGCTA CACGTCCTGC TGAACAACCG CCACGTCGGC 
CAGCTCCAAA AGGCGGTGGA CGGTGCAATC AGTTTTACCT ACGAGCAAAA CTGGCTGGAC
TGGGACCACG CCCTACCCGT CTCGCTCTCC CTTCCCCTCC GCGAAGACCC CTACCGGGGT
GCACCGGTGG CGGCAGTGTT CGACAACCTC CTGCCCGATG CCGAGCCGCT CCGCCGCCGC
GTTGCCGAGC GCGTTGGCGC CGAGGGCACC GACGCCTACA GCCTGCTCTC AGCCATCGGC
CATGATTGCG TCGGTGCCCT GCAATTCGTC GGCCCCGACG CCCCGGCCCC CGGCGACACC
ACCCAAATTT CCGGCCAGGT CATTGACGAG GACGACATCG GGAGGCTGCT CCGGGGGCTG
GCCCAGGCCC CACTGGGGCT GGACCGCGAC GAGGCATTCC GGATCTCCAT TGCCGGGGTG
CAGGAGAAGA CCGCCCTGCT CCGGCATGAA GGCCGCTGGC TAAAACCCCA CGGCACAACC
CCGACCAGCC ACATCCTCAA GCCTCAGATC GGCCAGTTGC CGAACGGCAT CGACCTGTCC
AACAGCGTCG AGAACGAATA CTACTGCCTC AAACTGGCTG CCGCCTTCGG GTTGCCCGTC
AACAGGGCCG AGATCCACAC CTTTGGGCCC ACCCAAGCAC TCGTCGTCGA GCGCTTCGAT
CGCCACTGGA CCCACGACGG CCGCCTGCTC AGACTCCCGC AGGAGGACTG CTGCCAGGCC
CTATCCGTCC CGCCAACACG CAAGTACCAG ACCGAGGGTG GCCCCGGTAT CGTGCAACTT
CTTGAACTGC TCAACGGCAG CGACACCCCG GCCAAAGACC AGGCGACCGT CTTCAAGGCA
CAGATCTTCT TCTGGCTGAT CGGCGCTACC GACGGGCACG CAAAGAACTT CAGCCTGTTT
CTGCGGCCAC AGGGCGCGTT TCGCCTGACC CCGCTGTACG ACATCCTGAC CGTCCAGCCG
AGCCTTGCCG GCCGGCAAAT CGAACGCAAG CAGATGAAAC TGGCCATGGC CGTGGGGCGC
GGAAACCGCT ACCGGATCCA TGAAATCCAG GGCCGCCATT TCCTACAGAC CGGCGCCGCC
GCCCGCCTGC CGCGCACCTT GGCCACCAAC GTCATCGAGG ACATAGTGAC CCGCGCGGAC
AACGCCATCA CGCAGGTCGA AAGCGCCTTG CCCCCCGACT TCCCCCCGGC AATCCACGAA
AGCGTGAAGG CGGCCATCGC CGGGCGCCTG GGGGTATTGC AGAGGGCGGG GTGA
 
Protein sequence
MPRRRRHPPL HVLLNNRHVG QLQKAVDGAI SFTYEQNWLD WDHALPVSLS LPLREDPYRG 
APVAAVFDNL LPDAEPLRRR VAERVGAEGT DAYSLLSAIG HDCVGALQFV GPDAPAPGDT
TQISGQVIDE DDIGRLLRGL AQAPLGLDRD EAFRISIAGV QEKTALLRHE GRWLKPHGTT
PTSHILKPQI GQLPNGIDLS NSVENEYYCL KLAAAFGLPV NRAEIHTFGP TQALVVERFD
RHWTHDGRLL RLPQEDCCQA LSVPPTRKYQ TEGGPGIVQL LELLNGSDTP AKDQATVFKA
QIFFWLIGAT DGHAKNFSLF LRPQGAFRLT PLYDILTVQP SLAGRQIERK QMKLAMAVGR
GNRYRIHEIQ GRHFLQTGAA ARLPRTLATN VIEDIVTRAD NAITQVESAL PPDFPPAIHE
SVKAAIAGRL GVLQRAG