Gene Mlg_1188 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1188 
Symbol 
ID4270323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1388288 
End bp1390015 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content72% 
IMG OID638125937 
Productendonuclease/exonuclease/phosphatase 
Protein accessionYP_742027 
Protein GI114320344 
COG category[R] General function prediction only 
COG ID[COG2374] Predicted extracellular nuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.219404 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.0718602 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGGCA AAGGGATTTC GATTCCGAGG CGATGGTTGT GGGCCGCGGC GGCCCTGCTG 
GTGCTACTGG TGCTTGTGCG CCACCTGCCA CCGGCGGCCG GTGCCTGCAG CGGCGATTTC
ACACCGATTT ACCAGGTCAC CGGGGACGAA CCGGGCGAGG CCCTCGATCC GGGGACGTCC
GTCCGCGTGA AGGGCGCGGT TACGGGGGTC TTTCTGGACG ACGGGGGCCT CGACGGCTTC
TTCATCCAAG GCGAGGGCCC CGGCGACGGC CTGCCCAGCG GGGTGTTTGT CTATGCGCCG
GGGCTGGCGC CGGAGGAGAT GGCGCGGGTC CGTGCCGGCC GCCAGCTGGC CCTGCGGGCG
CGCACCGGGC GCTGGCAGGG GCAGATCCAG CTTCAGCGCG TGCGGGCGTT AAACGACTGC
GGCGAGGCGG CGGAGTTGGC ACCGCAGCCC GTTCGCTTCC CGCTGCATCG CCCGGAGCGG
CGCTGGGCCG GCTTGTGGGT GCGGGTGGAG ACCCCGATGA CCGTCAGCGG CAGCCACGAA
CTGCAGCGTT ACGGCAGCCT GCACCTGGCG GCCGACGGGC GTGCCTTCCG GCCCAGCAAC
TTTCTCGACC CCGATGAGCG TCCGCCCGGC CGGCTGCGTT TGATCCTTGA TGACGGTAGT
CATTCGGTCT GGCCGGAGCC GGTGCCCTGG CTGGATGAGC GGGGGACCCG ACGGGTCGGG
ACCCGGGTGG AGGGCCTGGA GGGGATATTG GCCGACACCT TCGGTGCCCT GCGACTGCAC
CCCACCCGCA CGCCCCGGTT CGTCGACCAG AACCCGCGTC CAGCCCCCCC GGAACGGGCC
GGGGAGGGGC TGATCCGGGT GGCCGGTTTT AACGTGGAGA ACTACTTCCT GACCCTCGGC
GAGCGGGGGG CGGACAGTGC GCGGGCCCTG GATCGCCAGC GCGCCCGCCT GCTTCCGGCG
CTCGCGGCCC TGGACGCGGA CATTGTCGGC CTGGTGGAGA TGGAGAACGA CCGCGCCGCG
CTCGAGGACC TGGTTGCGGC GCTGAATGAT CACCTGGGCG CCGACCGGTA CCGGGCGGCG
CCCGGCACAC CGGACACCGG CAGCGACGAG ATCAAGGTCT CGCTGATCTA CCGCCCTGAC
CGGGTCGAAC GGGTGGGCGG ACCGCTCCGT GACCTGGAGC CCGTCCATCA CCGGCCACCG
GTCAAGGCGG CTTTCCGGCC GGCGGCGGGG GGCGCGCCCT TTGCCGTTGC CGTGGTCCAC
CACAAGGCCA AGGTGGGCTG CCCCGACAGC GACGACATCG ACCGCGGCCA GGGCTGCTGG
AACCTGCGCC GCCAGGCGCA GTCGGAAGCC CTGCTCGAGG CCATTGGCCG CTGGCGTGAG
GATCGCGCGG ACGACCTCCC CGTGCTCATC GTCGGCGACG TGAACGCCTA TGGCGGTGAG
GACCCGGTGC GCGCACTGCT GGCCGGTGGC AAGCGCGACC TGCTGGCGCG CCACCTGCCG
CCGGAACGGC GCTACACTTA CGTCTTTCGC GGCGAGTCCG GCTACCTGGA TCACGCCCTG
GCCCCGCCAC GGCTTGCAGA CCGGGTGCAG GCCGCCGGCA CCTGGGCCAT CAACGCCGAT
GAACCGCGGC TGCTGGAGTA CGATGCGCGT GGTATCGAGC GGCGCTTCCG GCCCGGGCCC
TGGCGCAGCT CGGATCACGA CCCGGTTTGG GTGGACCTGC GGCCCTGA
 
Protein sequence
MAGKGISIPR RWLWAAAALL VLLVLVRHLP PAAGACSGDF TPIYQVTGDE PGEALDPGTS 
VRVKGAVTGV FLDDGGLDGF FIQGEGPGDG LPSGVFVYAP GLAPEEMARV RAGRQLALRA
RTGRWQGQIQ LQRVRALNDC GEAAELAPQP VRFPLHRPER RWAGLWVRVE TPMTVSGSHE
LQRYGSLHLA ADGRAFRPSN FLDPDERPPG RLRLILDDGS HSVWPEPVPW LDERGTRRVG
TRVEGLEGIL ADTFGALRLH PTRTPRFVDQ NPRPAPPERA GEGLIRVAGF NVENYFLTLG
ERGADSARAL DRQRARLLPA LAALDADIVG LVEMENDRAA LEDLVAALND HLGADRYRAA
PGTPDTGSDE IKVSLIYRPD RVERVGGPLR DLEPVHHRPP VKAAFRPAAG GAPFAVAVVH
HKAKVGCPDS DDIDRGQGCW NLRRQAQSEA LLEAIGRWRE DRADDLPVLI VGDVNAYGGE
DPVRALLAGG KRDLLARHLP PERRYTYVFR GESGYLDHAL APPRLADRVQ AAGTWAINAD
EPRLLEYDAR GIERRFRPGP WRSSDHDPVW VDLRP