Gene Mlg_2370 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2370 
Symbol 
ID4270709 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2689132 
End bp2690253 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content73% 
IMG OID638127128 
Productcapsular polysaccharide biosynthesis protein-like protein 
Protein accessionYP_743200 
Protein GI114321517 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4421] Capsular polysaccharide biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0553875 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.0113185 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCCCT GGGTCAGGGC CGTGGATCGG CTCCTGGGCA CCGATTTCAA TAGCCACCCG 
GCCTGCGCCT GGCAGCCCAG CCCGGTGCCG TTGCACGCCA TCGAGCAGGC CGGCGACGGG
GCCGTGGACC AGGCCGTAAA CTGGCGGCAG ACGCCCCCCA TCGCGCCGGT TGGGGTGGGG
CGGGTGCGCC GGCCGTTGAT CCTGCTGCCG AGTCCGGCCC CGCAACTGGC GCATGACCCG
CTGCGGCGCC GCGCCGTCAT CCTCACCAGC GGCGGGCGGC CCGTGCGCTA TCCGAAGTCT
GCCCCGGCCC TGCGCCATCT GCTGCGCGGC GGTTGGCGCT ACAGTGCCCG GCTGGGCCGC
GCCGCCCGCC CGGCTGTCCG GCTGGGCACC GTCGCCGTCC TCGGCAACCA CGATCCCGGC
TGCAACAACT ACTACCACTG GTGGGCGGAC ACCCTCGCCG ACCTCTGGTT TCTGCGCGAG
TCCGGCGTGG ACCTGGGCCG GGTCGACAGC TTCCTGATGG CCTATGGCGG CTACCCCTGG
CAACAACAGT CCCTGGCCCT GTGCGGCATT GACCAGGAGC GGGTGGTGGC CTTTGCCGAC
CACCCCGCGC TGACCGCGGA GCAGGCGCTT GTACCGGTGC GGAGCAGGGG GAGTTGGGTG
TCGCCGGTCT GGCTGGCGAG GGCGCTGCGG GAGCTGACCG GGTGGCGGCC GCCGGCCGTC
ACCACCCCGG GCCGTCGCAT CTACCTGTCG CGGCGCGATG CCCCTCGCCG GCAGGCGGCC
AACGAGGCGG CGGTGGAGCG GCTGCTGGTG GATGAGTCGG GTTTCGAGAG TCACCAGTGC
AGCGGCCTGA GCGTGCCCCG CCAGCAGGCC TTGTTCGCCG ACGCCGAGGT CATCGTGGCG
CCCCACGGTG CGGCGCTCAC CAACCTCGTC TGGTGCCGCC CGGGTACCCG GGTGGTGGAA
CTGGTCCCCG AGGGCCACCG CAACCCCTGC TTCCGTGACC TGGCCGCCCA GTCCGGCCTG
GACTACCGCG CCATCCTCTG TCCGGCAACG GGTGCCGGGG GCGGCCTGAC TGCCGACATC
CAGGTGCCGC TGGCGCGCCT GCGAGAGGCA CTGGCCGGGT GA
 
Protein sequence
MSPWVRAVDR LLGTDFNSHP ACAWQPSPVP LHAIEQAGDG AVDQAVNWRQ TPPIAPVGVG 
RVRRPLILLP SPAPQLAHDP LRRRAVILTS GGRPVRYPKS APALRHLLRG GWRYSARLGR
AARPAVRLGT VAVLGNHDPG CNNYYHWWAD TLADLWFLRE SGVDLGRVDS FLMAYGGYPW
QQQSLALCGI DQERVVAFAD HPALTAEQAL VPVRSRGSWV SPVWLARALR ELTGWRPPAV
TTPGRRIYLS RRDAPRRQAA NEAAVERLLV DESGFESHQC SGLSVPRQQA LFADAEVIVA
PHGAALTNLV WCRPGTRVVE LVPEGHRNPC FRDLAAQSGL DYRAILCPAT GAGGGLTADI
QVPLARLREA LAG