Gene Mlg_0961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0961 
Symbol 
ID4270431 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1098301 
End bp1099776 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content71% 
IMG OID638125712 
Product4-alpha-glucanotransferase 
Protein accessionYP_741804 
Protein GI114320121 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1640] 4-alpha-glucanotransferase 
TIGRFAM ID[TIGR00217] 4-alpha-glucanotransferase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.143538 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGGG GTGGATTCTG GCAGCGGCGC CGGGCCGGGG TTCTCGCCCA CCTGAGTTCG 
CTCCCCGGAG AGGCGGGTAG CGGCAGCCTC GGCCGCCACG CCCACGGCTT CATTGATTGG
CTGGCCGATG CCGGCTTCTC GGTCTGGCAG ATGCTGCCCC TGGGGCCGAC CCACGACGAC
CTCTGCCCCT ACCAAACCCT CTCGGTCCAC GCCGGTGACG CACGGTTTAT CGATCAAGAG
GCGCTGGAGG CGGTCGGCTG GCTACCGCGG GAACCGCAGC CTTCCCACGC CCCCCGCCGT
TGGCGGCAGG AGCGGTTGTG CCGGGCGCGG GCCGGTCGGG CCGCCGCCGG CGATGAGGCG
TCGATGGCCG AGGAGGCCGC CTTCCGCCAA CGCCACGCCC ACTGGCTGGA GGACTATGCG
CTCTACGTCG CATTGCGCCG GGAGAAGGGG GAGGCCCCTT GGTGGGAGTG GCCCACCGCC
GAGCGGGACC GGGAGGAGGC TGCCCTGGCG GCGGCCCGGG AGCGGCTGGC CGACGCCATC
GACCAGGCCG TCTTCGAGCA ATTCCTCTTC TTCAGCCAGT GGGAGGCACT GCGCCGGCAC
GCCGGCGAGC GCGGCGTCGC CCTTTTCGGC GACATGCCCC TTTTCGTCGC CCACGACAGT
GCCGACGTCT GGGCCCACCG GGACTACTTC CAACTGGACG AGGCCGGCCG CCCCCGGACC
GTGGCGGGCG TGCCGCCGGA CTACTTCTCC GATACCGGGC AACGGTGGGG CAACCCCCAC
TATGAGTGGG CGCGGATGCA GGCCGACGGC TTCCACTGGT GGCTGGATCG GCTCGCCACC
CAGTTGGAGC TGTTCGACTT CGTCCGCCTC GACCACTTCC GAGGCCTCGC CGCCTACTGG
TCGATCCCCG CCGAGGCGGA GACCGCCCGG AACGGCCAGT GGGTCCCGGC TCCCGGCCGC
GCCTTCCTGG CCGCGGTGGC GGACCGGTTC GGGCGGGTGC CGCTGGTGGC CGAAGACCTC
GGCTTCATCA CCGAGGACGT GGAGGCGCTG CGGGACACCT TCGGGCTGCC GGGGATGAAG
GTCCTCCACT TTGCCTTCGA CAGCGACGCC GACAACCCCT ACCTGCCACA CCACCACGTC
CGCGACGGCG CCGTCTACAC CGGCACCCAC GACAACGACA CCACCGTCGG GTGGTACCAG
GGGCTCGATC CCGTTGTGGC GGAGCGGGTC GCCGCCTACC TGGGGTACCC CGGCGAAGAG
ATGCCCTGGC CGCTGATCCG CGCCGCCCTG GCCTCGGTGG CCGGCCTGGC CGTGGTGCCG
ATGCAAGATC TGCTGGCGCT GGACTCCGAC CACCGCATGA ACATCCCCGG CGTCGCTGGC
GGCGACAACT GGCGGTGGCG ATTCCAGTGG GAGTGGCTGC CGGAGGGATT GCAGCAGCGC
ATGGCGGAGA TGAACCGGCT TTACGGGCGG GGCTGA
 
Protein sequence
MSGGGFWQRR RAGVLAHLSS LPGEAGSGSL GRHAHGFIDW LADAGFSVWQ MLPLGPTHDD 
LCPYQTLSVH AGDARFIDQE ALEAVGWLPR EPQPSHAPRR WRQERLCRAR AGRAAAGDEA
SMAEEAAFRQ RHAHWLEDYA LYVALRREKG EAPWWEWPTA ERDREEAALA AARERLADAI
DQAVFEQFLF FSQWEALRRH AGERGVALFG DMPLFVAHDS ADVWAHRDYF QLDEAGRPRT
VAGVPPDYFS DTGQRWGNPH YEWARMQADG FHWWLDRLAT QLELFDFVRL DHFRGLAAYW
SIPAEAETAR NGQWVPAPGR AFLAAVADRF GRVPLVAEDL GFITEDVEAL RDTFGLPGMK
VLHFAFDSDA DNPYLPHHHV RDGAVYTGTH DNDTTVGWYQ GLDPVVAERV AAYLGYPGEE
MPWPLIRAAL ASVAGLAVVP MQDLLALDSD HRMNIPGVAG GDNWRWRFQW EWLPEGLQQR
MAEMNRLYGR G