Gene Mlg_0647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0647 
Symbol 
ID4270837 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp698391 
End bp699566 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content68% 
IMG OID638125396 
Producthypothetical protein 
Protein accessionYP_741491 
Protein GI114319808 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.268839 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.0829547 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAGC CCGCCCCCAC CAACGAAGCC CCCGACCTCT CCGGACACCC CCGGGCCGGG 
GACGTGCGCC GCTTCGCCAC CGGCGAGGCG CTGGTCCTCG CGCCGCTGCC ATTACCGCTC
CCGGTGCGCC CCATGGACCC GAAGCAGTTC CTCGTCCACA TCGACCAGAC CTACCTGGAT
CTGGACTCCA GCCGCCACAC CGCCCAAGCC CACCAGGTCA TGATCGGCGT GCCCTTCTTC
ATCGGCGTGA TCTTTATCGG GCTCGGTTTT CCGTTGCTTA TTGGCACGAC CGCGTTCGGG
ACCGACCACA CCTTTTGGGC AAGCGCCCTG CATGCCGCCA TTGTCTCCAT CCCCTACGGC
CTTTTCGGCG GCACCCTGTT GTTTCTCATT GCCCTCCACG GCTTTTTCCA CCGCATGAAG
CAGGCCCGGC GGCATCCGCC GGTGCGCTTC CATCGCCAGC GCCGGGAGGT CGCCTGCTTC
GACCCCGACA CCGGCCAGAC CCTCGTCGCC CCGTTCGAGC GCGTCACCGC CTGGATGGCC
ACCAGCAGCG GCGCCACCCC CTACGGCGCC ATGACCCACT ACAACTTCGG CCTCACCGTC
GAGGACGCGG AAACCGGACA GTCCTATACC GCCCTCTTCC CCGCCTCGCT CCCCGAGGAG
GCCCTGGGCC TGTGGGAGGC CATCCGCCGC TACATGGATC ACGGGCCGGG CACGCTCGAA
CGGCCCACGA AAACCTTCTC CGGCTTGCCC ATCGACCCCA GGGAGCACCT CCCCTACGAC
GGCGTCCACA CCCTCGAGAT CGCCCGCAAG AAACTCCACG AAGACCTTCG TGATGGCTTC
ACCAGCCGGG TCTTCGTCTT CTTCTGGTAC CTCTACCACC TGATCACCTT CTGGAAGCTG
CCCTTCCGGC TGGCCACCTG GGAATACCAC CGGAGCCGCG CGCCCATTCC CCCCGAGATC
CAGGCCTGGT CCGAACCCAT CCCGGAGCAC GACTGGGCCA CGCCCAGCCC CGAACTGGAG
GCCGCCGCCC GGCGCATGGT GCAGGCCGGC GAGCAAGCCC CCGACATCAA ACTCCCCGAG
CTGCTCGCCG CCGGCATCGC CGACTGGCAC CCGGACCACG ACGGCGGCAA CGGCAACGGC
AACGGCAACA ACGAGCGGAC CCCACGGAAA CCATGA
 
Protein sequence
MSKPAPTNEA PDLSGHPRAG DVRRFATGEA LVLAPLPLPL PVRPMDPKQF LVHIDQTYLD 
LDSSRHTAQA HQVMIGVPFF IGVIFIGLGF PLLIGTTAFG TDHTFWASAL HAAIVSIPYG
LFGGTLLFLI ALHGFFHRMK QARRHPPVRF HRQRREVACF DPDTGQTLVA PFERVTAWMA
TSSGATPYGA MTHYNFGLTV EDAETGQSYT ALFPASLPEE ALGLWEAIRR YMDHGPGTLE
RPTKTFSGLP IDPREHLPYD GVHTLEIARK KLHEDLRDGF TSRVFVFFWY LYHLITFWKL
PFRLATWEYH RSRAPIPPEI QAWSEPIPEH DWATPSPELE AAARRMVQAG EQAPDIKLPE
LLAAGIADWH PDHDGGNGNG NGNNERTPRK P