Gene Mlg_1746 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1746 
Symbol 
ID4270853 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2003003 
End bp2004184 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content71% 
IMG OID638126504 
Producthypothetical protein 
Protein accessionYP_742582 
Protein GI114320899 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.681924 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGACA GCCCCGAGAT CGAGCTGCAG TTCCAGGAAG GCTGCGACGA CTGCGGTCGC 
CGGGAGGTCC GCCTGCCGCC ACGGCTGCCG GCGCTGGGCG ATGACTTCGA CTGGGACCTG
CGGGACTACG ACGGCTTCCG CCTGTTCATG CTCGAGGAGC TGGCCGCGCG CTTCCCGGAG
CGCAAGCGCT GGACCCCGGC CGACCTGGAG GTGGTGCTGG TGGAGGCGTT GGCCGCCGTG
CTCGACCAGC TCTCCGACAC CCTGGACCGG GTGGCCGGCG AGGCCTACCT GGAGACCGCC
CGGCGCCCCG AGTCGGTGCG CCGGCTGCTG TTGATGATCG GCTACGACGC GCTGGGGCTG
CGCCGGCGCC AGGGCCTGCC GCCCTTCGAC GGGGAGCACG ACGGCGACCC GATTGCGGCC
ATCGAACGCC TGGAACAGTA CTGGCTGGAC CATCCGGAGG ACATGGAGCG GGACCGCCAG
GAGGGGCCGC GCCAGATCCA CCGCCAGCAC CGCATTGTCA CCACCGCGGA CTTTGTCACC
CGGCTCGAGG CCCACCCGGT GGTGGAGCGC GCCCAGGCGG CCGAGACCTG GAACGGGAGC
TGGTCGCTCA TCCAGGTCGC CGTCATCCCC TGGGCCCGGG TGGGCCTGGA CGCCCCGCAG
GACTACGACG ATGCGCTTTG GACGCGCATC GAGCAATTCC ACGCCGAGCG CGACCTCTAC
CTGCCCGGGC GCGACGGCCG GCCGCCGGTG CGCAGCCTGC TGCGCCACTA CCTGGACGAT
TACCGCATGG TCGGCCAGGA GGTGCAGTTG GTGCCGGCCG AGGAGGTGGG CTTGTCGCTG
TCGCTCTCCA TCCAGGTCGC CCCTCACTAC TTCCAGTCGG AGGTCCGCCG GGCGGTGGAG
CAGGCCCTGG GAACCGGCCC GGGGGGGTTC TTCGAGCCGG GCCGGCTGCG CTTCGGCGAG
GATGTCTGGG CCGGCGACCT GTTCCAGTAC CTGATGGCGC TGGACGGGGT GGAGAACCTC
TGCCTCAACC GCTTCAAGCG CATCGGTACC CGCTTCCCGG ACATGAGTGG GACCGGGCGC
ATCGCCCTCA ACGGCCTGGA GCTGGCCGTC TGCGACAACG AACCCGAACA CCCGGAGCGG
GGCTATTTCC ACCTGCGGCT GCACGGCGGG AGGCGGGGCT GA
 
Protein sequence
MADSPEIELQ FQEGCDDCGR REVRLPPRLP ALGDDFDWDL RDYDGFRLFM LEELAARFPE 
RKRWTPADLE VVLVEALAAV LDQLSDTLDR VAGEAYLETA RRPESVRRLL LMIGYDALGL
RRRQGLPPFD GEHDGDPIAA IERLEQYWLD HPEDMERDRQ EGPRQIHRQH RIVTTADFVT
RLEAHPVVER AQAAETWNGS WSLIQVAVIP WARVGLDAPQ DYDDALWTRI EQFHAERDLY
LPGRDGRPPV RSLLRHYLDD YRMVGQEVQL VPAEEVGLSL SLSIQVAPHY FQSEVRRAVE
QALGTGPGGF FEPGRLRFGE DVWAGDLFQY LMALDGVENL CLNRFKRIGT RFPDMSGTGR
IALNGLELAV CDNEPEHPER GYFHLRLHGG RRG