Gene Mlg_0114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0114 
Symbol 
ID4268201 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp126691 
End bp127611 
Gene Length921 bp 
Protein Length306 aa 
Translation table11 
GC content49% 
IMG OID638124840 
Producthypothetical protein 
Protein accessionYP_740961 
Protein GI114319278 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAATA GCCCGCCTGT AGAGAGAATT CGACTACCGG TTCACTTCGC CGGATTTACC 
CTCGGCGGCA TCACGCTACG ATTATCCGTA GATAAGTCGC TTTTTATTCA CGACAAAAAA
GGAAGAAGAA ACCCGGAGAG AATGGGTCCG CATGGATTTT TGGCACGGTC CAGTCCGGAA
GCACCTGAGT TCCCACGCCT TAGTCTTGAC AAAGGTTTTA TCCGGTACGT CCCACGAGTT
TATCGGCGAC ACTACATTGA CTTATCTCAA GGCTGGGAAA CCTACCTGCA CACCTTTTCC
GGAAAGTCGC GACAAAGCAT TCGCCGAAAG ATTCGTAAGT TCGAAAAGGC ATCTGAGGGA
GAGTTGGAAT GGCGGTGCTA CAAGACGGAG AATGAAATAA GAAGATTCCT TGAGTTGGCG
AGAGGAGTGG CGGACGTCAG TTACCAAAAG CGACTGCTAG GTGCTGCTTT GCCTGATAGC
CAGGAATTCT ACGATGCCGC CATTTCTTTA GCCCAGAATG ACCAAGTAAG GGGTTTTCTC
CTTTTTCATT CTGGCGATCC TGTTGCTTAC CTATACTGCC CAGCTCAGGA TGGCGTTTTA
CGATACCGTT TCTTGGGGTA CAAACCCAAC GCCGCAACTC TCTCTCCAGG AACGATATTA
CAATGGCTAG CACTAGATAA TTTGTTTAAA GAGGGACGCT TTCAATACTT CGATTTCTGC
GAGGGAGACG CCCCTCACAA AGCATTCTTC GGAAGCCATT GCAGAGTCTG TGGCGATATA
TATTGGCTTA AGCTAACTCC CAAAACCATT GCGGCCGTAG TGCTAAACTT ATCCAGCCTG
ACACTTTCCG AAGGCCTCTC CTGGTTTCTT GATCAAGCCG GATTAAAGGA CCGAATAAGA
CGATCATTGC GAGGCCGGTA A
 
Protein sequence
MRNSPPVERI RLPVHFAGFT LGGITLRLSV DKSLFIHDKK GRRNPERMGP HGFLARSSPE 
APEFPRLSLD KGFIRYVPRV YRRHYIDLSQ GWETYLHTFS GKSRQSIRRK IRKFEKASEG
ELEWRCYKTE NEIRRFLELA RGVADVSYQK RLLGAALPDS QEFYDAAISL AQNDQVRGFL
LFHSGDPVAY LYCPAQDGVL RYRFLGYKPN AATLSPGTIL QWLALDNLFK EGRFQYFDFC
EGDAPHKAFF GSHCRVCGDI YWLKLTPKTI AAVVLNLSSL TLSEGLSWFL DQAGLKDRIR
RSLRGR