Gene Mlg_1727 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1727 
Symbol 
ID4268976 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1976020 
End bp1977741 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content69% 
IMG OID638126485 
Productpeptidase M14, carboxypeptidase A 
Protein accessionYP_742563 
Protein GI114320880 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGCAC CCACCGACAT CGAGCCCTTC CGGCACCAAT ACCTGGATTA CGACACCCTC 
ACCGGGCAGT TGCAGCACTG GGCGTCGGCG CACCCGGAGG TGGCCCGCCT GGAGAGCCTG
GGCACCAGCC CGGAGGGCCG CGAGATCTGG CTGCTCACCG TGGGCAGGCG GCCGGAGCGG
TCGCGTCCCG CCGTCTGGGT GAACGGCAAC ATGCACGGTT CGGAGTTGGC CGGCTCCAGC
GTGGCGCTGG CCGTGGCCGA GGCGGCCCTG CAACTCCACC TGAGCGGTGC CAACACCCAC
GGCGGACTGC TGCCCCATCT GGAGGAACAA CTCCGCGGCG TGCTCTTCTA CATCTGCCCG
CGCGTCTCGC CCGACGGTGC GGAACAGGTG CTTCATCACG GCGGCTTCGT GCGCTCGGCA
CCCCGGCGCA GCCCCCACGC CCCCGACACC CCCCGTTGGG AACCGAGCGA CTTGGACGGC
GATGGCCGCT GCCGCTATCT GCGGATGGAG GATCCGGCCG GGCCGTTCGT CGCCTCGCCC
CGGCATGCCG GGCTGATGCT GCCCCGTGAA CTGGACGACC CACCCCCCTA TTACCGCCTC
TACCCGGAGG GGCTCATCCG TCACTGGGAT GGCCACACCG TGCCGGAGCC GGAACCGTTG
CGAGACACCC CCGATTTCAA CCGCAACTTT CCCTGGAACT GGCGGCCAGA GCCGGACCAG
ACCGGCGCTG GGCACTTTCC AGGCTCCGAG CCCGAGACCC ACGCGGTGCT CGATTTCGCC
ACCCGCCATC CCAACATCTA CGCCTGGCTC GATCTACACA CCTTCGGCGG GGTCTTCATC
CGCCCGCTGA CCGGCGCCCC GGACGCCGCC ATGGACCAGG ATGACCTGGC GCTCTACCGC
CAATTGGCCG CCTGGGGCGA AATGCTCACC GGCTATCCCA CGGTCAGCGG CTTCGAGGAG
TTCACCTACG AACCCGAGAC CCCGCTTTAC GGGGACCTGA CCGACTTCGC CTATCACCAA
CGCGCCTGCC TGGCCCAGGT CTGTGAACTC TGGGACCTCT TCCGCCGGCT CGACCTGCCC
CGCCCGAAAC GCTTCGTGGA CCTCTACACC AGCCTGCACC GTGGCGACAT GGAACGGCTG
GCCCAGTGGG ATGCCGAGCA CAACCGCCAA CGGCTCTTCC GGCCCTGGTT GCCCCTCAAG
CACCCGCAAA TCGGGCCGGT AGAGGTTGGC GGGCTGGACC CCAGCATCGG CATCTGGAAC
CCGCCCCCCG AAGCGCTGCC CGACATCTGT GACGGCATCG CCACCTATTG GTTGCGGGCT
GCCGCCCTGC TACCCCGACT GACTATCGCC GGGCTGGAAT GCCGCCCGCT GGGGGACGAC
CACTGGGAGA TCATCGCGGT CGTGGCGAAT CACGGCTACC TGCCCACCTA CGGCGTAGCC
GCCGGCCGCA GTCGGCCCTG GAACGACGGT GTGGAAACCG AACTCTGGCT GGAGGGCTGT
ACCCTGACCG AGGGCCAGCC GGCCCGCCAA GCCCTTGGCC ATCTCGACGG CTGGGGCCGC
GGATTGGGGA ACATGGCGCA CATGCCCTGG TTCCAGCGCT CACGGGGCAG CAGCCATCAG
GCCCGGGCAC GCTGGGTGGT GCGGGGCCGG GGCACAGTCA CGCTGAGCGT CCGAAGCACC
CGGCTCGGGA CCCTGGCACA GACCCGGCGA CTCACCCCTT GA
 
Protein sequence
MGAPTDIEPF RHQYLDYDTL TGQLQHWASA HPEVARLESL GTSPEGREIW LLTVGRRPER 
SRPAVWVNGN MHGSELAGSS VALAVAEAAL QLHLSGANTH GGLLPHLEEQ LRGVLFYICP
RVSPDGAEQV LHHGGFVRSA PRRSPHAPDT PRWEPSDLDG DGRCRYLRME DPAGPFVASP
RHAGLMLPRE LDDPPPYYRL YPEGLIRHWD GHTVPEPEPL RDTPDFNRNF PWNWRPEPDQ
TGAGHFPGSE PETHAVLDFA TRHPNIYAWL DLHTFGGVFI RPLTGAPDAA MDQDDLALYR
QLAAWGEMLT GYPTVSGFEE FTYEPETPLY GDLTDFAYHQ RACLAQVCEL WDLFRRLDLP
RPKRFVDLYT SLHRGDMERL AQWDAEHNRQ RLFRPWLPLK HPQIGPVEVG GLDPSIGIWN
PPPEALPDIC DGIATYWLRA AALLPRLTIA GLECRPLGDD HWEIIAVVAN HGYLPTYGVA
AGRSRPWNDG VETELWLEGC TLTEGQPARQ ALGHLDGWGR GLGNMAHMPW FQRSRGSSHQ
ARARWVVRGR GTVTLSVRST RLGTLAQTRR LTP