Gene Mlg_0643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0643 
Symbol 
ID4270832 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp693339 
End bp694868 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content63% 
IMG OID638125391 
Producthypothetical protein 
Protein accessionYP_741487 
Protein GI114319804 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.0169115 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAACA TTGTCGAAGC CTTCACCGAC CCGCAGTTGC TAGGCCAGTC GTTTGGCGAG 
GATACCCGCG AGGCTTGGCG TGCTGTCCTG TCAGGCGCGT TCGCCCTGCC GATGGATGAT
GACCGCCTTG CCCTGTTTAA GCGGCTGTCA GGCGACCGTG AGCCGCCTCA GCGGCAGGCG
CGGGAGCTAT GGGCCATTGC GGGCAGGCGC AGCGATAAGA CCCATACCGC AGCCGGCATT
GCGGTCTACC TGGCGACCAT CGGGGCGGAG CTGGACGGCA CCCTGGCCCG CCTGACCGCT
GGCGAGCGTG GTGTGGTTCA GTTGCTTGCG GTGGATCGCC AACAGGCCAA GGTTGCGCTT
GGGTACGTTC GCGGCCTGTT CGCTGATAGC CCGGTCATGT CCAGCCTGGT GGAGAAGGAG
AACACCGAGG GCGTTTTGCT CCGCAACGGC GTATCTATTG AGGTGGCCAC CAACAGTCAC
CGGGCCGTGC GTGGGCGCAC TCTGCTGGCG GCCATTCTGG ACGAATGCGC CTTCTTCAAG
GATGAGGCGA CCGCAACCCC CGATGTAGAG GTTTACCGCG CCCTGGTGCC TTCTCTGGCG
ACCACCGGGG GCATGCTTGT CGGCATTTCC AGCCCCTACG CCCGCCGGGG GCTGCTCTAC
AGCAAGTGGC GCAAGCACTA CGGCAAGCCC GGTGATGTGC TGGTGGTGCA GGGCGGAACA
CTGGACTTCA ACCCCACCCT TGATCCCCGC GTTATCGCGG AGGCGGAGCA GGATGACCCC
GAGGCGGCCA AAGCCGAATG GCATGGGCAA TTCCGTGCGG ATGTTGAGGG TTTTGTAACA
CGCGAAGCCG TAGACGCCTG TACCGTGCCG GCCCGGATCG AGCTACCGCC GGTGGCTGGC
GAGCGGTACA CGGCGTTTGT GGACCCGTCT GGCGGCTCCA AGGATGCGTT CACGCTGGCG
ATCGCCCACC AGTCTGATGG TGTGGCCGTG GTCGACGCTA TCCGCGCTCA GAAGCCCCCG
TTCAGCCCGG AGGCGGTGGT TAAAGAGTTC GCCGGTCTGC TGAAGGAATA CCGGATCAGC
AAGGTGGTGG GCGACCGTTA CGGCGGTGAG TTTCCGCGCG AGCTGTTCCG AAAACAGGGC
ATTGCCTACA AGCTATCTGA TCGGCCTAAG TCTGACCTGT ATCGTGACAT GCTCCCGTTG
CTCAATTCTG GGCGCGTGGA GTTGCTGGAC AATGGCCGAT TGCAAAACGA ATTGACGAGC
CTGGAGCGGC GCACCAGCCG CGCCGGGAAA GACTCAATAG ACCACCCGCC AAATGGCACC
GACGATGTGG TTAACGCGGT GGCCGGGTGT ATAATTGAGG CAGCAAAACC CAAAGCCGTT
CCCCGCGTGC GGCGGATGGG TGATTCGATC AGTTCCGGTT CCAATTCTGG TTCCAGCGTT
CTGGATCGAA TCGCGCAAGG TCAGAAAGCC AGCGGGAGGT TGGCGAGTAC GCGCCGAAGT
TTTCTGCGAG ATGCAGACGG TTTCGGTTGA
 
Protein sequence
MTNIVEAFTD PQLLGQSFGE DTREAWRAVL SGAFALPMDD DRLALFKRLS GDREPPQRQA 
RELWAIAGRR SDKTHTAAGI AVYLATIGAE LDGTLARLTA GERGVVQLLA VDRQQAKVAL
GYVRGLFADS PVMSSLVEKE NTEGVLLRNG VSIEVATNSH RAVRGRTLLA AILDECAFFK
DEATATPDVE VYRALVPSLA TTGGMLVGIS SPYARRGLLY SKWRKHYGKP GDVLVVQGGT
LDFNPTLDPR VIAEAEQDDP EAAKAEWHGQ FRADVEGFVT REAVDACTVP ARIELPPVAG
ERYTAFVDPS GGSKDAFTLA IAHQSDGVAV VDAIRAQKPP FSPEAVVKEF AGLLKEYRIS
KVVGDRYGGE FPRELFRKQG IAYKLSDRPK SDLYRDMLPL LNSGRVELLD NGRLQNELTS
LERRTSRAGK DSIDHPPNGT DDVVNAVAGC IIEAAKPKAV PRVRRMGDSI SSGSNSGSSV
LDRIAQGQKA SGRLASTRRS FLRDADGFG