Gene Mlg_2325 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2325 
Symbol 
ID4270580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2635515 
End bp2636714 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content65% 
IMG OID638127083 
Productputative polysaccharide biosynthesis protein 
Protein accessionYP_743155 
Protein GI114321472 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3980] Spore coat polysaccharide biosynthesis protein, predicted glycosyltransferase 
TIGRFAM ID[TIGR03590] pseudaminic acid biosynthesis-associated protein PseG 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.412888 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGTGG CAGCGCAACA CAGGACAACG ACTGGCGGGA TGCGCGTCGC CTTCCGCGTG 
GACGCCTCCC TGGCGATCGG ATCGGGTCAC GTCATGCGCT GCCTCACCCT CGCCGGGGCA
CTGTGCGAGC AGGGCGCGGA CTGCCATTTC CTCTGCCGCG AGCCGCAAGG TCATCTCAAC
AGTCAGATAG CCGAGCGGGG ATTTGCGGTT CATCGTCTGC CCGCGGTAGA GGATGGCTCG
ATAACGTCGC CCGCCGGTTC GGGCCGTTCA GCGAGCGTGG ATGACGAACC ACCGCAACCG
AAACACGCCG AGTGGCTGCA AACCACCCAG GCGACCGATG CCCGCCAGAG TCTCGAAACG
CTGCGAGAGC TGGCACCGGA CTGGCTGATC GTCGACCACT ACGCACTGGA TGCCCAGTGG
GAGGCGCGGG TTCGAGAGGC CATTCCGGGG ATGCGCGTCA TGGTCATCGA CGATCTGGCC
GACCGCCTGC ACCAGGCCGA CCTGCTGCTG GACCAGAACC TGGGCCGCAA GGCCGAGGAC
TACCGTGACC TCGTCCCGGC CCACTGCCGC CTTCTCGTCG GGCCGAAGTA CGCCCTGTTG
CGCCCGGAAT TCGCCGAATG GCGGGAATGG AGCCTGGAAC GCCGACAGGA GAACGGGCCG
GTCAGGCGGC TGCTGGTCAG CCTCGGCGGC GTGGACAGGG ACAACGTCAC CGGGCAGGTC
CTCGATGCCT TGTCCGAAGT CGAGTTGTCG AAAGAAATGG AAATCACCGT GGTCATGGGC
GCATCCGCCC CTTGGCTTGA AGCGGTTCGG GGCCGCGCCC GGCAGATGCC GTGTTCGACG
GAAGTCGTGG TTAACGTCGA TGACATGGCC CGGCGCATGG CCGAGGCCAA TCTTGCCATC
GGCGCGGCGG GCAGCACGGC GTGGGAGCGC TGCTGTCTTG GCTTGCCGAC CATCGTGCTG
GTGCTGGCGG AGAATCAGCG GGAGATCGCG CGAAGCCTGC ATCGTGCGGG TGTGGCTCAT
TCACTTGGTG CCCCTGATGC ATTGTTCGAT CTGGTTGGCC AATGGCCAAT GATCACCCAG
CCAGAGTACT TGAAAGGCCT GAGCCGGAAG GCCGCAAGCC TGGTGGATGG CCGTGGTGCC
GTCCGTGTGC GGAATGGGCT GATGGGCGTT GAGATGGCGA ACGAGGCGAA CGATGGTTGA
 
Protein sequence
MAVAAQHRTT TGGMRVAFRV DASLAIGSGH VMRCLTLAGA LCEQGADCHF LCREPQGHLN 
SQIAERGFAV HRLPAVEDGS ITSPAGSGRS ASVDDEPPQP KHAEWLQTTQ ATDARQSLET
LRELAPDWLI VDHYALDAQW EARVREAIPG MRVMVIDDLA DRLHQADLLL DQNLGRKAED
YRDLVPAHCR LLVGPKYALL RPEFAEWREW SLERRQENGP VRRLLVSLGG VDRDNVTGQV
LDALSEVELS KEMEITVVMG ASAPWLEAVR GRARQMPCST EVVVNVDDMA RRMAEANLAI
GAAGSTAWER CCLGLPTIVL VLAENQREIA RSLHRAGVAH SLGAPDALFD LVGQWPMITQ
PEYLKGLSRK AASLVDGRGA VRVRNGLMGV EMANEANDG