Gene Mlg_2047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2047 
Symbol 
ID4270181 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2318842 
End bp2320443 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content72% 
IMG OID638126803 
Productgamma-glutamyltransferase 2 
Protein accessionYP_742879 
Protein GI114321196 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0405] Gamma-glutamyltransferase 
TIGRFAM ID[TIGR00066] gamma-glutamyltranspeptidase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0649366 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.471768 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTGCCA ATGACACGCC GCTGAGGGCG GCGGTGAGTG CACCCCATCA TCTGGCGGCC 
GAGGCGGGCG CCGGCGTGCT GCGGGAAGGG GGGAATGCCA TCGAGGCCAT GGTGGCCGCT
GCCGCAGCGA TTGCGGTGGT CTATCCGCAC ATGAACGGCC TGGGGGGCGA CAGCTTCTGG
TTGTTGCGCG AGCCGGGCAG GGCGCCCCTG GGTATCGAGG CCTGTGGCCC GGCGGCGGTC
GGTGCGACAC CCGCCTGGTA CCGCGACCGG GGGCTGACGG TCATCCCTAG CCGCGGAGGG
GCGGCCGCCA ACACCGTCGC CGGCACGGTG GCCGGCTGGC AGGTGGCGCT GGCGCTTAGT
CGTTCGCAGT GGTCGGGGCG CCTGCCCCTG CACCGGCTCC TGGCGCCCGC CGTGGCGCAT
GCGCGAGAGG GGTACCCGAT GACCCATAGC CAGGCCGAGG CCACCCGGGA CAAGCATCCG
GAGCTGGGGC CGCAGCCGGG GTTTGATGCC CAGTACTTGC CCGGCGGGGC CTTTCCCGTC
CCGGGCGAGA CGTTCCGCCA GCCGGAGCTG GCCAGCACCC TGGAACGCCT GGCGGCGGTC
GGCCTGGCGG ACTTCTATTC CGGCGAGCTG GCACAGGCGC TGGGTGAGGG CCTCGCCGAG
GCGGGCAGTC CGATACGGGC CCCGGACCTG GCGGGCTATG CCGCCCGCCG GGTGGCACCG
CTGAACATGA GGCACAGCCT GGGCACCCTG TGGAACATGC CGCCACCCAC TCAGGGGTTG
GCCTCGTTGA TGATCCTCGG AGTCTTTGAC CGGCTGCAGC GGCGCCACCC GGTAGCGGCG
GAGAGCGCAG AGTGGTTGCA CGCCATGGTG GAGGCAGTCA AGCAGGCCTT CCTGGTGCGG
GACCGCGTTG TCACCGATCC CGCCTATCTG CCGGAGGATC CAGCGCAGTG GCTGAAGCCG
GAGGCGCTGG ACGCCTTGGC GGATCGGGTG GACTGGAGCC GCGCCCTGGC CTGGCCGCAG
CCGGCCTCCC CGGGGGACAC CGTATGGCTG GGGGCCATCG ACGCCGAGGG TCGCTGTGTC
AGCTTCATCC AGAGCCTGTT TCATGAGTTC GGCAGCGGGG TGGTCGTCCC CGACACCGGC
GTGATCTGGC AGAACCGGGG CTGCAGCTTC TCGCTCGCGC CCGATGCCCT GAATGACCTC
AAGCCGGGCC GCCGTCCCTT CCATACCCTG AACCCGGCCC TGGCGGTTCT GGATGACGGC
CGTACCCTGG TATACGGCAC CATGGGGGGG GAGGGCCAAC CGCAGACCCA GGCGGCGGTG
TTCACCCGGG TGGCCCTCTA CGGCCAATCA CCGGAGCAGG CGGTGGCCTC GCCGCGCTGG
CTGTTGGGCA GGACCTGGGG GGCCGGGACG GACACGCTGA AGCTGGAAGC GGATTTCCCG
CCGGAGTTGG TGGAAGCGCT GCGGGGCCGT GGCCACGACG TGGAGGTGGT GCCGCCCCGC
AACAGTGCGA TGGGCCACGC CGGCCTCCTG GTGCGTGACC GGGCGGGCCA TGTCCGGGCG
GCATCCGATC CGCGTAGCGA TGGAGGCGTG GCGGGGCTAT GA
 
Protein sequence
MLANDTPLRA AVSAPHHLAA EAGAGVLREG GNAIEAMVAA AAAIAVVYPH MNGLGGDSFW 
LLREPGRAPL GIEACGPAAV GATPAWYRDR GLTVIPSRGG AAANTVAGTV AGWQVALALS
RSQWSGRLPL HRLLAPAVAH AREGYPMTHS QAEATRDKHP ELGPQPGFDA QYLPGGAFPV
PGETFRQPEL ASTLERLAAV GLADFYSGEL AQALGEGLAE AGSPIRAPDL AGYAARRVAP
LNMRHSLGTL WNMPPPTQGL ASLMILGVFD RLQRRHPVAA ESAEWLHAMV EAVKQAFLVR
DRVVTDPAYL PEDPAQWLKP EALDALADRV DWSRALAWPQ PASPGDTVWL GAIDAEGRCV
SFIQSLFHEF GSGVVVPDTG VIWQNRGCSF SLAPDALNDL KPGRRPFHTL NPALAVLDDG
RTLVYGTMGG EGQPQTQAAV FTRVALYGQS PEQAVASPRW LLGRTWGAGT DTLKLEADFP
PELVEALRGR GHDVEVVPPR NSAMGHAGLL VRDRAGHVRA ASDPRSDGGV AGL