Gene Mlg_1734 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1734 
Symbol 
ID4270841 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1983309 
End bp1984436 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content70% 
IMG OID638126492 
Productputative capsule biosynthesis protein 
Protein accessionYP_742570 
Protein GI114320887 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2843] Putative enzyme of poly-gamma-glutamate biosynthesis (capsule formation) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.720674 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.681474 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTCGAT GGAGTCGTCC GGTGGAGCAC AGCGCAACGC TTTTCCTCGC GGGGGATGTG 
ATGACCGGTC GTGGCATCGA CCAGGTGCTG CCGCACTCGG TGGATCCGCG GATTTACGAG
CCGGTGATGG ACTCGGCCGA GGGGTATGTG GCGCTCGCGG AGCGGGCAAA CGGGCCGATA
CCCAGACCGG TGGATTACGC CTATGTCTGG GGCGATGCCC TGGCGGAGCT GCGGGCGTGG
GCGCCGGCGC CGCGCCTGAT CAACCTGGAG ACGGCGGTCA CCACCTGCGA CGACCCCGAG
CCCAAGGGCA TCCATTACCG CATGCACCCG GGTAATGTGC CGCTGCTGCA GGCGGCTGGC
GTGGACGGTT GCGTCCTCGC CAACAACCAC GTGCTGGACT GGGGCCGGGC CGGTTTGCTG
GAGACGCTGG AGGCACTGGA CCGGGCGGGC ATCCGTTATG TCGGGGCCGG TCGGGACCGG
CGGGAGGCGG CGGCGCCGAC GGTATTTGGG GTGCCCGGTG GGCGGCTGGT GCTGTACGCG
CTGGGTATGC CCAGCAGCGG AGTGCCGGCG GCCTGGGCGG CCACCGAGGA GCAGCCGGGC
GTCAATGTGT TGCCCCGGCC CTCGGATGAG GCCGTGACCC TGCTGGCCGA GGCCGTCCGC
CGTGAGCGCC GACCCGGCGA CGTGGTGGCG GTCTCGCTGC ACTGGGGCTC CAACTGGGGC
TACGCCATCC CCCGGGCACA TCGGGTGTTT GCCCACGCGC TGATCGATCG GGCCGGGGTC
GATCTGGTCT GGGGTCACTC TTCCCATCAT GTGCGCGGTA TCGAGGTCTA CAAGGACCGC
CTGATCCTCT ACGGTTGTGG TGATTTCCTC AATGACTACG AGGGGATCGG TGGCCATGAG
GCCTTCCGCG GCGACTTGGT GCTGATGTAC CTCCCAGCAC TGGCGCTCGC CGACGGCCGG
CTGCGGGCCC TTACCCTGGT ACCGCTGCAG ATCCGGAACT TCCGGCTGCA CCGGGCGAAG
CGGGCGGACG CCGAATGGCT GGCCGCGGTG CTGGACCGCG AGGGACGGGC GCTGGGCACC
TCGGTGACCG TGAGCGCGGA CGGTCGCCTG GCGCTGCACT GGCGGTGA
 
Protein sequence
MGRWSRPVEH SATLFLAGDV MTGRGIDQVL PHSVDPRIYE PVMDSAEGYV ALAERANGPI 
PRPVDYAYVW GDALAELRAW APAPRLINLE TAVTTCDDPE PKGIHYRMHP GNVPLLQAAG
VDGCVLANNH VLDWGRAGLL ETLEALDRAG IRYVGAGRDR REAAAPTVFG VPGGRLVLYA
LGMPSSGVPA AWAATEEQPG VNVLPRPSDE AVTLLAEAVR RERRPGDVVA VSLHWGSNWG
YAIPRAHRVF AHALIDRAGV DLVWGHSSHH VRGIEVYKDR LILYGCGDFL NDYEGIGGHE
AFRGDLVLMY LPALALADGR LRALTLVPLQ IRNFRLHRAK RADAEWLAAV LDREGRALGT
SVTVSADGRL ALHWR