Gene Mlg_0802 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0802 
Symbol 
ID4270566 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp897885 
End bp899915 
Gene Length2031 bp 
Protein Length676 aa 
Translation table11 
GC content68% 
IMG OID638125553 
Productpolysaccharide biosynthesis protein CapD 
Protein accessionYP_741646 
Protein GI114319963 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAAG CGTTCCTCAT CGATCTGCCA CGCCCGGTGA AGCGGTTCGT CATGATCATG 
GCGGACACGC TCATGCTGCC CGTGGCCCTG TGGGCGGCCT TCAGCCTGCG CCTGGGGACC
CCCGTCCCCG GTGTGCTGCT GGACTACGGG TGGCTGCTGC TGCTCGTGCC GGCGGTCAGC
CTGCCGGTTT TCGCCCTGTT CGGCCTTTAC CGCGCGGTGG TCCGTTACAT GGGGCTGCAG
GCGATCATTG CGGTGGTGCA GGGGGTGACC CTGTCCGCAC TGGTCTTCGG CGCCCTGGTT
ATGCTGGGCC GGCTGGAGGG TATTCCCCGC TCCGCTCTGC TGATCTACTG GCTGCTGGCG
CTGTTCATGG TGGGCGGGTC GCGCCTGGTG GTGCGCGCTT GGTTCCAGGC GGCGATCAAA
CGCCGGGGGA CCGAGAAGCC GGTGGTCATC TACGGCGCGG GGACCGGGGG GATTGGCCTG
GCCACCAGCC TGTTCAATGG CCGGCAGTAC CGCCCGGTGG CCTTCGTGGA TGACAACCCG
GCAAAACAGG GCACGGTGAT CGCCGGGCTG CCCGTGCGCA AACCGGCCGA GCTGGCAGAA
CTGATCGACC GCAACCGGCT GGAGTATATC CTGCTGGCCA TGCCGCGGCT GTCGCGGACC
CGCCGCCGGG AGATCGTCGC CCAGTTGGAG CCGCTGCCCG CCCACATCCT CACCATCCCC
AGCCTGGCGG ACATCGTGGC CAACCGGGCC AGTCCCGACG AGGTGCGGGA GGTGGAGGTG
GAGGACCTGC TCGGGCGTGA TGCCGTGGGC CCGCGGCGGG AGCTGCTCGC CCGCTGCATC
CGCGGTAAGA CGGTCATGGT CACCGGCGCC GGGGGCTCCA TCGGCTCCGA GCTTTGCCGG
CAGATCCTGC GCGAGCAGCC CACTCAGCTG GTGTTGGTCG AGCGCTCCGA GTTCGCCCTC
TACGCCATCG AGCGCGAGCT CCAGGCTCAG TTGCAGGGGG AGGCCCGGCG CCCCGCCGTC
CAGGCCGTCC TGGCCAATGT CACCGACCTG GCGCGCATGG AAATGCTGAT GCACGCCTTC
CGCGTCGACA CGGTCTATCA CGCCGCTGCC TACAAGCACG TCCCGCTGGT GGAGACCAAC
GTGCTGGAGG GCATCGATAA CAACGTCTTC GGCACCCTGC ACACGGCCCT GGCGGCGGTG
GAGGCCGGGG TCCGGTACTT CGTGCTAGTG AGCACCGACA AGGCCGTCCG CCCCACCAAT
GTCATGGGGG CGAGCAAGCG CCTGGCCGAA CTGGTGCTGC AGGGGTTGGC CCGTCAGCCC
GATATCCGCA CCCGCTTCTC CATGGTCCGC TTCGGTAATG TGCTCGGCTC CTCCGGCTCG
GTGGTGCCGC TGTTCCGCGA GCAGATCCGC AACGGCGGCC CGATCACGGT CACCCACCCC
GAGGTCACCC GCTATTTCAT GACCATCCCC GAGGCCGCCT CGCTGGTCCT GCAGGCGGGA
TCCATGGCCC ACGGCGGCGA GGTCTTCGTG CTGGACATGG GCGAGCCGGT GCGCATCGTC
GACCTGGCCC GGCGCATGAT CCGTCTGTCC GGTCTCGAGG AGCGAAACGC GGAACACCCC
GATGGCGACA TCGAGATCCA TTTCACCGGC CTGCGCCCCG GCGAGAAGTT GTATGAGGAG
CTGCTGCTGG GTGAGGCGGT CACCGAGACC AGCCACCCCA TGATCATGCG CGCCCGCGAG
GGGCACTTGC CCACCGATAC CCTGCAAAGC CTGCTAAGCG AGCTGCGCGC GGCCTGCCGG
CGCTACAACA CCCAGGACGC CCGCAAGCTG ATGGCCCAGG TGGTGGAGGG TTACGAGGCC
GCCGGACCCA ACTGCGACGT GCTCGGGCGG CAGCTGAACG AGTCGGCCGA CGCCGTCGCC
CGAGGCCTGT TCAGGGACGG CGAAGGCCCC TGCCCCATCC GCCGGGCCTA CCAGCGTGGC
ACCACGCCAA TAAGCGCCCC CCAATCAGAA GCCCCATTGG CTAAACGCTG A
 
Protein sequence
MNKAFLIDLP RPVKRFVMIM ADTLMLPVAL WAAFSLRLGT PVPGVLLDYG WLLLLVPAVS 
LPVFALFGLY RAVVRYMGLQ AIIAVVQGVT LSALVFGALV MLGRLEGIPR SALLIYWLLA
LFMVGGSRLV VRAWFQAAIK RRGTEKPVVI YGAGTGGIGL ATSLFNGRQY RPVAFVDDNP
AKQGTVIAGL PVRKPAELAE LIDRNRLEYI LLAMPRLSRT RRREIVAQLE PLPAHILTIP
SLADIVANRA SPDEVREVEV EDLLGRDAVG PRRELLARCI RGKTVMVTGA GGSIGSELCR
QILREQPTQL VLVERSEFAL YAIERELQAQ LQGEARRPAV QAVLANVTDL ARMEMLMHAF
RVDTVYHAAA YKHVPLVETN VLEGIDNNVF GTLHTALAAV EAGVRYFVLV STDKAVRPTN
VMGASKRLAE LVLQGLARQP DIRTRFSMVR FGNVLGSSGS VVPLFREQIR NGGPITVTHP
EVTRYFMTIP EAASLVLQAG SMAHGGEVFV LDMGEPVRIV DLARRMIRLS GLEERNAEHP
DGDIEIHFTG LRPGEKLYEE LLLGEAVTET SHPMIMRARE GHLPTDTLQS LLSELRAACR
RYNTQDARKL MAQVVEGYEA AGPNCDVLGR QLNESADAVA RGLFRDGEGP CPIRRAYQRG
TTPISAPQSE APLAKR