Gene Mlg_0646 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0646 
Symbol 
ID4270836 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp697261 
End bp698394 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content66% 
IMG OID638125395 
Producthypothetical protein 
Protein accessionYP_741490 
Protein GI114319807 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.436257 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.112784 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAGC CCGCCCCCAC CAACGAAGCC CCCGACCTCT CCGGACATCC CCGGGCCGGG 
GACGTGCGCC GCTTCGCCAC CGGCGAGGCG CTGGTCCTCG CGCCGCTGCC ATTACCGCTC
CCGGTGCGCC CCATGGACCC GAAGCAGTTC CTCGTCCACA TCGACCAGAC CTACCTGGAT
CTGGACTCCA GCCGCCACAC CGCCCAAGCC CACCAGGTCA TGATCGGCAT GCCCTTCTTC
ATCGGCGTGA TCTTTATCGG GCTCGGTGCA CCGATGCTTA TCGGCGCGGC CGGACTCGTG
TATGGCGAAC CATTCTGGGC AAATGCCCTC TATGCCGCCA TAGTCTCCAT CCCCTACGGC
CTTTTCGGCG GCACCCTGTT GTTTCTGATC CTCCTCTACG GCTTCGTCGA CCGCATGAAG
CAGGCCCGGC GGCATCCCCC GGTGCGCTTC CATCGCCAGC GCCGGGAGGT CGCCTGCTTC
GACCCCGAAA CCGGCCAAAC CCTCGTCGCC CCGTTCGAGC GCGTCACCGC CTGGATGGCC
ACCAGCAGCG GCGCCACCCC CTACGGCGCC ATGACCCACT ACAACTTCGG CCTCACCGTC
GAGGACGCGG AAACCGGACA GTCCTATACC GCCCTCTTCC CCGCCTCGCT CCCCGAGGAG
GCCCTGGGCC TGTGGGAGGC CATCCGCCGC TACATGGATC ACGGGCCGGG CACGCTCGAA
CGGCCCACGA AAACCTTCTC CGGCTTGCCC ATCGACCCCA GGGAGCACCT CCCCTACGAC
GGCGTCCACA CCCTCGAGAT CGCCCGCAAG AAACTCCACG AAGACCTTCG TGATGGCTTC
ACCAGCCGGG TCTTCGTCTT CTTCTGGTAC CTCTACCACC TGATCACCTT CTGGAAGCTG
CCCTTCCGGC TGGCCACCTG GGAATACCAC CAGAGCCGCG CACCCATTCC CCCCGAGATC
CAGGCCTGGT CCGAACCCAT CCCGGAGCAC GACTGGGCCA CGCCCAGCCC CGAACTGGAG
GCCGCCGCCC GGCGCATGGT GCAGGCCGGC GAGCAAGACC CCGACATCAA ACTCCCCGAG
CTGCTCGCCG CCGGCAGATC CTGCAAACTT CAAAAGGTGC GTATGCATCC ATGA
 
Protein sequence
MSKPAPTNEA PDLSGHPRAG DVRRFATGEA LVLAPLPLPL PVRPMDPKQF LVHIDQTYLD 
LDSSRHTAQA HQVMIGMPFF IGVIFIGLGA PMLIGAAGLV YGEPFWANAL YAAIVSIPYG
LFGGTLLFLI LLYGFVDRMK QARRHPPVRF HRQRREVACF DPETGQTLVA PFERVTAWMA
TSSGATPYGA MTHYNFGLTV EDAETGQSYT ALFPASLPEE ALGLWEAIRR YMDHGPGTLE
RPTKTFSGLP IDPREHLPYD GVHTLEIARK KLHEDLRDGF TSRVFVFFWY LYHLITFWKL
PFRLATWEYH QSRAPIPPEI QAWSEPIPEH DWATPSPELE AAARRMVQAG EQDPDIKLPE
LLAAGRSCKL QKVRMHP