Gene Mlg_1885 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1885 
Symbol 
ID4269740 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2147604 
End bp2149574 
Gene Length1971 bp 
Protein Length656 aa 
Translation table11 
GC content71% 
IMG OID638126641 
Producttransglutaminase domain-containing protein 
Protein accessionYP_742719 
Protein GI114321036 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.307822 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.724755 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGAA CAGCGCACCC GCCGACCCCG CTGCCCCGCC AGGCCCAGGC CTGGCTGCTG 
GCCACCCTCG CACTGGTGGC TGCCGCCCAC GCCCCCCACC TGCCCTGGTG GCTTAGCCTG
GGGGCGCTGG CCTGCGGACT GTGGTGCTGG CAAAACACCC GCCAGTGGCG CCGCCTGCCG
CCGCGCAGCC TGCGCTTCGG ACTCACCCTG CTGGCAACCG GCGGGGTCTA CCTGAGCTAC
GGCGAACTCC TGGGCCAGGA GGCGGGAACG GCGTTGCTGC TTGCCATGAC GGGGCTCAAA
CTGCTGGAAC TGCGCAGCCG GCGCGACGCG GTGCTGCTGG TGGGGCTGGG GTTCTTCCTA
GTGGCCACGC AGTTCCTGCG CTCCCAGGAA CTGCCCATGG CCTTTTACCT GGGCGCCTGC
ACCCTGGGGC TGATCATCAG CCTGATGGGG GCGACCCGGG AGAACGCCCC ACCCCACCCG
CTGGCGCATC TGCCTCAGGC GGCCACCTTG CTGCTCCAGG CCCTGCCCTT CATGGTCCTG
CTGTTCTTCC TCTTCCCCCG CCTGGACGGC CCCCTCTGGG CCATGCCGGA GGATGGTGAC
CGGGCCCGGA CCGGGCTCAG CGACCGCATG GAGCCCGGTC AGATCAGCGA ACTGGCCCGC
TCGGACGCCG TGGCCTTCCG GGTGGAGTTC GAGGCCGCGC CGCCCCCCCG ATCCCGGCGT
TACTGGCGCG GCCCGGTGTT CGAGGCGTTC GACGGCCAGC GCTGGCAGCC CGCCGACCAC
CTGCACGAGC GCCGGGAGCC GGTCGAACCA GCGGCGGACG CCGAGCGCAT CGAATACACC
GTGACCCTGG AGCCCCATGG CCAACCCTGG CTGTTCGGCC TGGATCTTCC GCTGACCCTG
GCGGCCGGGC ATGCCAGCGC CCGGGGCGCC CAGACCTGGG AGCGGGACAC GCCGGTCAAC
CGGCGCGTAC GCTACGCGGG CCAAAGCACG CCCGATTACC GTCTGGGCGC CACGCTGGAC
CCGCAGCGCC GTGAGACCAA TCTGGCCCTC GCGCAGAACC GCCACCCGAG AACCGTGGCG
CTGGCCCGCT CCTGGCGGGC GCAGACCGAC GACCCGCAGG CCCTGCTGGG GCAGGCCATC
AGCCATTTCC GTGAGCAGGA TTTCGGCTAC ACCCTGAGCC CACCCCTGAT GCGCGACGAC
ATGGTCGATG AGTTTCTGTT CGACCATCAA CAGGGGTTCT GCGAACACTA CGCGGCCGCC
TTTGCCGTGA TGATGCGTGC CGCCGGAGTA CCCGCCCGCG TAGTCACCGG CTACCTGGGC
GGCGAGATGA ACCCGACCGG CGACTACATG ATCGTGCGCC AGTCCGACGC CCACGCCTGG
AACGAGGTCT GGCTCGAGGG CGAAGGCTGG ATCCGGATCG ACCCCACCGC ACTGGCCGCC
CCGGCACGGC TGGACGAGGG TCTGGCCGGC GCCCTCGGGG ACCCCGCCAG CGCCCCGGCC
CTGGCCCGGC TGGACGCCTC GTGGCTGCGC GACCTGCGCC TGCGCTGGGA TGCGGTGAAC
ATGACCTGGC ACCGCTGGAT GCTCGGTTAC GGCCCGGAGT TGCAGCGCCA GTGGCTGGAA
CGCCTGGGGC TACAGAGCTG GCAACAAGCG GTGATGGCCC TGGGCCTGGC CCTGATCGCC
GCCTCCCTGC TGCTGGCCTG GGTCACGCTT TACCGCAACG CCCCGCCTGC CGCCGATCCG
GCCGTCCGTG CCTGGCAGCG TCTGTGCCGG CGTCTGGCCC GCCGGGGCCT GCCCCCACGG
CCGGGTGAGC CACCCAACTG CTATGCCCGC CGGATTGCCC GGGCGCGCCC GGACCTGGCT
GCCGCCATCT TGCATGTGGC CCGGCTCTAC CAGCAGTATC GGTATGAACC CAACCCCAGC
CAGGCCGACC TGCAGGCCCT CCGCAAATGC ATCAGACGCC TGCGGCCCTG A
 
Protein sequence
MSGTAHPPTP LPRQAQAWLL ATLALVAAAH APHLPWWLSL GALACGLWCW QNTRQWRRLP 
PRSLRFGLTL LATGGVYLSY GELLGQEAGT ALLLAMTGLK LLELRSRRDA VLLVGLGFFL
VATQFLRSQE LPMAFYLGAC TLGLIISLMG ATRENAPPHP LAHLPQAATL LLQALPFMVL
LFFLFPRLDG PLWAMPEDGD RARTGLSDRM EPGQISELAR SDAVAFRVEF EAAPPPRSRR
YWRGPVFEAF DGQRWQPADH LHERREPVEP AADAERIEYT VTLEPHGQPW LFGLDLPLTL
AAGHASARGA QTWERDTPVN RRVRYAGQST PDYRLGATLD PQRRETNLAL AQNRHPRTVA
LARSWRAQTD DPQALLGQAI SHFREQDFGY TLSPPLMRDD MVDEFLFDHQ QGFCEHYAAA
FAVMMRAAGV PARVVTGYLG GEMNPTGDYM IVRQSDAHAW NEVWLEGEGW IRIDPTALAA
PARLDEGLAG ALGDPASAPA LARLDASWLR DLRLRWDAVN MTWHRWMLGY GPELQRQWLE
RLGLQSWQQA VMALGLALIA ASLLLAWVTL YRNAPPAADP AVRAWQRLCR RLARRGLPPR
PGEPPNCYAR RIARARPDLA AAILHVARLY QQYRYEPNPS QADLQALRKC IRRLRP