Gene Mlg_1797 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1797 
Symbol 
ID4268716 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2051890 
End bp2053017 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content46% 
IMG OID638126553 
Producthypothetical protein 
Protein accessionYP_742631 
Protein GI114320948 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTGAGG GCTGCTTAAT GTTAAGAAAT GAACTAGAAA CCGCAAAGCG ATCAGTTGTT 
ACCGATAGTT TGCAAATCTC CATTGGGGAG ATCGCTTCCA TGTATGCGGC GGAGGAGCTA
GATATTATCC CTGATTTTCA GAGATTATTT CGTTGGTCTC CTGAAAAGAA GACGGCTTTT
ATCGAATCGA TTTTGATTGG AATCCCTGTG CCCCCGGCTT TCGCATATGA AAAGCCAGAT
GGAACATGGG AGCTGATAGA CGGGTTACAA CGTATCTCCA CAGTGCTCGA GTTCATGGGC
GTTTTGCGCG AACCAGATAA TCCTGCAGAG AAGATGCCGC CTTCTACTCT GACAACTGCA
ACCTATCTTC CGTCCTTAGA TGGTGTCGTG TGGCCAACAG AAGGAGGAGA TGGAGGTGCT
CAAGTTCTTG AAAAATCTCT TCAGCTTTTC TTTCGTCGGT CACGCTTGGA TTTCCAGATA
CTTAAACACC CAAGTGACGC AAAAACGAAA TTTGATTTGT TTCAGCGATT GAATCGGGGT
GGCGAGTACG CTAATGAGCA GGAAGTCCGC ACTTGTTCAA TGGTGCTTGG AAATGCAGAT
GCTACCGCTC GAATAAGGAA TCTGGCAAGA AGCGAAGAAT TTATAAATAT TTTTAAAATT
ACGGAAGAGC AGAATTTAAA ACAAAAAGAT GTTGAATATG CGGTTCGTGC TATTGTTCAT
ACAGTCGAAG ATTTTGGCTC AGATGCTGAT GTTCAGGAGT TCCTCGACCG AAGTATTGCG
CGAATTATCG TGGATCAAGA TCCGAATGAT GTGATCCATA CGGTGGAATG GGCCGTCAAT
AGCTTGCATA GCTTATTCGG GGGCGATGCC TTGATTCCTC ATGACGAGGC TTATCAAGGA
ATAGCCAAAA GATTTTCTCT GCGTGCACTG GAAGCGATAT TGGTAGGGGT CGCCAGAAAC
AAAGAAAAAA TTCAGGACTT GGAAGACCCG GATGGTTTTT TGCGTGAGCG CGTCGATCGG
TTCTGGCGAG AAAGAGATGT CGCCGAGTTG AGTGCGTCAG GCTTGCGAGG GACTACTCGG
ATCCAAAGAA CCGTACCATT TGGTGAGAAT TGGTTTTCTC CACAATGA
 
Protein sequence
MLEGCLMLRN ELETAKRSVV TDSLQISIGE IASMYAAEEL DIIPDFQRLF RWSPEKKTAF 
IESILIGIPV PPAFAYEKPD GTWELIDGLQ RISTVLEFMG VLREPDNPAE KMPPSTLTTA
TYLPSLDGVV WPTEGGDGGA QVLEKSLQLF FRRSRLDFQI LKHPSDAKTK FDLFQRLNRG
GEYANEQEVR TCSMVLGNAD ATARIRNLAR SEEFINIFKI TEEQNLKQKD VEYAVRAIVH
TVEDFGSDAD VQEFLDRSIA RIIVDQDPND VIHTVEWAVN SLHSLFGGDA LIPHDEAYQG
IAKRFSLRAL EAILVGVARN KEKIQDLEDP DGFLRERVDR FWRERDVAEL SASGLRGTTR
IQRTVPFGEN WFSPQ