Gene Mlg_0934 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0934 
Symbol 
ID4268221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1058971 
End bp1060140 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content69% 
IMG OID638125686 
Producttetratricopeptide repeat protein 
Protein accessionYP_741778 
Protein GI114320095 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2956] Predicted N-acetylglucosaminyl transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGAGT TGCTTTGGCT TTTGCTGCCC GTGGCGGCGA TGTCCGGGTG GTTGGCCGGG 
AGACGGAGCG GGGCCGGGCA TCGGGGCGGC GAGCAACGGG ACCTGCCCGA GGCCTATTTC
CAGGGCCTCA ACTACCTCCT GAACGAGGAG CGCGACAAGG CGCTCGAAGT GTTCACCCAA
ATGGTGGAGG TGGACAGCGA GACAGTCGAG ACCCACCTGG CGCTGGGCAG CCTGTTCCGG
CGCCGGGGTG AGGTCGACCG CGCCATCCGT ATTCACCAGA ACCTCATCGC CCGCCCGGCC
CTGAGCCGCC AGCAGCGCAC CTACGCCCTG CTGGAGCTGG GCGAGGACTA CATGCGTGCG
GGCCTGCTCG ACCGGGCCGA GACGCTCTTC GAGGAGGTGA TCGACCTCAA CCACCACGTC
GAACCGGCCC TGCGTCAACT GCTGGCCATC TATCAGCAGG AGAAGGAGTG GGACCAGGCC
ATCGGCGCCG CCCTGCGCCT GGAAAAGGTC TCCGCCCAGA ACCTCCACCC CCAGGTGGCG
CACTTCTACT GCGAGATGGC GGGCGAAGCC TGGGCGGCCG GCGATCTCAG CCGTGCCCGC
ACCCTCTACA AGCGCGCCCT CACCCACGAC CCGCGGTGTG TCCGGGCGAG TATCCAGGCC
GGGCATCTGG CGCGGCAGAT GGGGCATGCC CGGCAGGCGG TACGTTTGTA CCGGCAGGTG
CCTACCCAGG CACCGGAGTT CGTCGGCGAG GTGCTCGATG GCCTGTACCA GGCGCTGGAG
AGCCTGGGTC AGCTCCACCG CTACCCGGAG TTTCTCGATC AATTGCTCGC CACCGGCAAG
GCCCCGGTGG CGGTGGCGCT GGCGAAAGTG GAGTGGCTGC GCGCGGAGGC TGGGCACGAG
GCGGCGATGC GCTGGCTGGC CGAGCACCTT GAAGCCCAGC CCTCGGTGCG CGGCCTACTC
CGGCTGGTGG AGATGAGCGA CGGCGCCCCC CCTGTGGCGG AGGGTCCGGT GGAGGCGGCA
CTGCACCGGA CTCTCCGCGC GCTGCTGGAG GCGCGGGCGC AGTACCTTTG CGGGCAATGC
GGCTTCACCG CCCGCACGCT GTTCTGGCAA TGCCCCGGCT GCAAGAGCTG GGGCAGTATC
CGCCCCCTGC GTGGCGTGGA GGGAGAGTAA
 
Protein sequence
MPELLWLLLP VAAMSGWLAG RRSGAGHRGG EQRDLPEAYF QGLNYLLNEE RDKALEVFTQ 
MVEVDSETVE THLALGSLFR RRGEVDRAIR IHQNLIARPA LSRQQRTYAL LELGEDYMRA
GLLDRAETLF EEVIDLNHHV EPALRQLLAI YQQEKEWDQA IGAALRLEKV SAQNLHPQVA
HFYCEMAGEA WAAGDLSRAR TLYKRALTHD PRCVRASIQA GHLARQMGHA RQAVRLYRQV
PTQAPEFVGE VLDGLYQALE SLGQLHRYPE FLDQLLATGK APVAVALAKV EWLRAEAGHE
AAMRWLAEHL EAQPSVRGLL RLVEMSDGAP PVAEGPVEAA LHRTLRALLE ARAQYLCGQC
GFTARTLFWQ CPGCKSWGSI RPLRGVEGE