Gene Mlg_0668 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0668 
Symbol 
ID4268463 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp733110 
End bp735269 
Gene Length2160 bp 
Protein Length719 aa 
Translation table11 
GC content70% 
IMG OID638125417 
Productputative PAS/PAC sensor protein 
Protein accessionYP_741512 
Protein GI114319829 
COG category[L] Replication, recombination and repair 
COG ID[COG2176] DNA polymerase III, alpha subunit (gram-positive type) 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00573] exonuclease, DNA polymerase III, epsilon subunit family
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCGCAC CACACCGCCC GCCGCGTGCA CTGGAGCCCC GCACCCGGTT TTGGCTCCTG 
TTGCTGGTGC CCGTTGGGGT GCTGGTCCTT ACGCTGCTGG CGCTGGGCCG CTACCTGGTG
CCGCAACTGC CCGACGAGCT CCAGCAGGGC CGTGCCATCC TGGCATTGGG CGGCTTTGGG
CTCGGCTTCG GCGGCCTGCT GGCCCTGCTC TGGCTGGTGC TGGACCAGCG CCTGGCCACC
GCCGCGGCGG CGGTCAGCCG GGGCGCCGCC ATCATGCGCC ACGGCAACCC CAGCCACCGG
ATCGAGCTGC CCGGCCCGCA CCTTCTGGGC GAACTCCCCG AGCGGGTCGA GACCCTGGGA
CGGGCACTTT ACGAGTGCCG GCAGGAGGTG GCCAAGGCCA TCACCGGCGA CCCGCGCGGC
CTGGAGGCCC AGAAGGCGCG GCTGGAGATC GTCCTGCGCG GGCTGCGCCA GGGGGTGGTG
GTCTGCGACG AGCAGGGTCG CATCATGCTC TACAATCCGG CCGCCCGCGC CATCCTCGGC
GACCACCCGG CGCTGGGCCT GTGCCGGTCG CTTTACGGTA TTCTCGCCCG GGCGCCGCTG
GAGCACAGCC TCGAGCTGCT GCGCCACCAG CGCGACGAGG GCGAAGGCAA GGCCGAGCCC
GACGAGGCGG TCACCGAGGC CAGCACGGAG TTCGTCTGCG CCACCCAGGA TCAGGGCAGC
CTGCTCCACT GCCAACTCGC TCTTCTGCCC GCCTCCAGCC TGTTACAGTC CGCCTTTGTG
CTCACCTTCG ACGACATCAC CCGCGAGCTC CGGGGCATGG CCGAACGGGA GCAACGACTA
CGCCAGACCA TGGAGGCCTT GCGGGCACCG CTGGCCAGCC TTCGGGCCGC CACCGACAGC
CTGGACCGGC GGGCGGACAT GACCCCGGAA CAGCGCCAGG CCTTCGAGGC CTTAGTGGCC
CGGGAGACCA GCGCCCTCGG GACACGCTTC GAGCAACTGG CCGCCGAGGC GCAGCGCTTC
GTCTCCACCC CCTGGGTGAT GGCCGATCTC TACACCGCTG ACCTGCTCGG CAGTGTATTG
CGCCGCAGCC AGCGCGCCCT GCCCCGGGTG GACGAGGTCG GGCTGCCGCT GTGGGTCCAC
GCGGAGGGCC ATGCGATTGG GCTGGTACTG GAGGATCTGC TCGCCCGCCT GGCCGGGGAG
CACGGGGTGG ACCGGGTGAC CGTGGAGGCC CTGATGGGCG ATCAACGCGT CTATCTCGAC
CTCACCTGGC GCGGCAGGCC GGTCGCCAGC GAGACCCTGG ATCGATGGCT GGACGCCCCC
CTCCCGGACC TGGGGGGCGA GCTCTCCGCG CGGACCCTCC TGGAGCGGCA CCACACGGTC
GCCTGGAGCC AGCACAGCGA ACGCGACCCG GGGTGCGCCT TTCTGCGCAT CCCCCTACCG
GCCTCGGAGC GCCAATGGCA ATCGCCCAGC CACGCACCCG CCGTGCAGCA CGAGTTTTAC
GACTTTTCCG TGGCGCAGCA ACCGCCGCAG TTGGGCGAGT GGGCCGAGCG AGCGCTGGAT
CAGCTCAGTT GCGTGGTCTT CGATACCGAG ACCACCGGCC TGTCGCCGGC GCGCGGCGAC
GAGATCATCG CCATCGGTGC CGTGCGTCTG GTCAACGGAC GGGTGCTGCA GGACGAGTAT
TTCGAGCAAC TGGTGAAACC GTGCCGGGCG ATACCGGACA GCGCCACCCG CTTCCACGGC
ATCAGCAACG AGGATGTGGC ACGGGCGCCG GAGATCGGAC CGGTTCTGCG GGCCTTCTCG
CGCTTCATCG GCGAGGAGAG CGTACTGGTG GCCCACAATG CCGCCTTCGA TATGGGGTTT
CTGCGCCGGG CACAGGCCCG GGCCGGGCTG GCGTTCCCCC AGCCGGTGCT CGATACGCTA
CTATTGTCGG TGTATTTGCA CGACCATTCC CCCGACCACA CGCTGGAGGG CGTGGCAGAG
AGGCTGGGGG TCGAAGTCCA GAAGCGCCAC AGCGCACTGG CCGACGCGCG GGTCACCGCC
GATGTCTTCG CCCGAATGAT TCCCCTTCTG CGTGAACGGG GTGTGGTCAC TTTGGGCCAG
GCAATCCAGG CCTCAGAGCA GGTGGTCACG GTCCGCAGAG AGCAGGCCCG ATTCCGTTAG
 
Protein sequence
MSAPHRPPRA LEPRTRFWLL LLVPVGVLVL TLLALGRYLV PQLPDELQQG RAILALGGFG 
LGFGGLLALL WLVLDQRLAT AAAAVSRGAA IMRHGNPSHR IELPGPHLLG ELPERVETLG
RALYECRQEV AKAITGDPRG LEAQKARLEI VLRGLRQGVV VCDEQGRIML YNPAARAILG
DHPALGLCRS LYGILARAPL EHSLELLRHQ RDEGEGKAEP DEAVTEASTE FVCATQDQGS
LLHCQLALLP ASSLLQSAFV LTFDDITREL RGMAEREQRL RQTMEALRAP LASLRAATDS
LDRRADMTPE QRQAFEALVA RETSALGTRF EQLAAEAQRF VSTPWVMADL YTADLLGSVL
RRSQRALPRV DEVGLPLWVH AEGHAIGLVL EDLLARLAGE HGVDRVTVEA LMGDQRVYLD
LTWRGRPVAS ETLDRWLDAP LPDLGGELSA RTLLERHHTV AWSQHSERDP GCAFLRIPLP
ASERQWQSPS HAPAVQHEFY DFSVAQQPPQ LGEWAERALD QLSCVVFDTE TTGLSPARGD
EIIAIGAVRL VNGRVLQDEY FEQLVKPCRA IPDSATRFHG ISNEDVARAP EIGPVLRAFS
RFIGEESVLV AHNAAFDMGF LRRAQARAGL AFPQPVLDTL LLSVYLHDHS PDHTLEGVAE
RLGVEVQKRH SALADARVTA DVFARMIPLL RERGVVTLGQ AIQASEQVVT VRREQARFR