Gene Mlg_2006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2006 
Symbol 
ID4270480 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2275718 
End bp2277907 
Gene Length2190 bp 
Protein Length729 aa 
Translation table11 
GC content69% 
IMG OID638126762 
Productputative PAS/PAC sensor protein 
Protein accessionYP_742838 
Protein GI114321155 
COG category[L] Replication, recombination and repair 
COG ID[COG2176] DNA polymerase III, alpha subunit (gram-positive type) 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00573] exonuclease, DNA polymerase III, epsilon subunit family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.475164 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGTCT CTGATCATAG TGAACACTGG GTGGAGCGGA TCAGCCGGGG TCTGCTCGGC 
CGGCACGCCA CGCCTACCGG CCACACCAGC CGCCGACTGC TGTTCTGGGC GCCGGCAATG
GGGCTGGCTG TTCTGCTGAC AGGGCTGTTG CTCGGCCTCG CCTATCTCTC CCTCTCCGCG
CTGCCCGCCG GCGCCGACCC AACACCGCTG GTGGTCGCCT TCGGCGCTGC GGGGCTGCTC
CTGCTGGCGG CGATCACGGC CATCTGGCTG TTGCTGGATG CCACCGTGCT GCGCTCCCTG
TCCGCGCTGG CCCGCGATGC GGCCATACTG GCCTACACCA ACCCCGATCA CCGGCTTCAG
CTGCCGTCGG TGCACCTGCT CGGCGAGTTG CCCGGGACGC TGCGCAACCT CGCCCGCCAG
TTGCAGGTCC GGCGCCGGGA GGTAGAGGCC GCCGCGGCCA CGGCCGCCGA ACAGGCCGAG
GCGCAGAAGG CGCGGCTGGA GGTGGTGCTG CGCGCCATCC GGCAGGGGGT CGTGGTCTGT
GATGCCGACG GACGGATCCT GCTCTACAAC CCCGCCGCCG GTGAGCTGCT CCACAGCGAC
GCCCTGGGGC TTGGCCGCTC CATCCATGAA CTGTTGAACC CAGCCGCCGT TGAGCACCCC
CGACAACTGC TGCAACACCG ACTGCGCCAG GATTCGGACG ACCCGGTCAT GGACGAGGGG
GTGGAGTTCG TCTGTACCAC GGTCGATGAC GGCGCCCTGC TCCGCTGCCA GATGAGTCTG
CTGCCGACCC ACGGCCCGCT GCGCTCGGCC TTCGTCATCA CCCTGGAGGA CATCACCCGC
CGGATCGAAG GCGTGGCCCG CCGTGACCAA GCCCTGCGCA GCGCCGTGGA GGCCCTGCGC
TCGCCCCTGG CCGCGGTGTC GGTGGCGGCG GAGCTGCTCA ACGAGTACCC GCAGATCGAC
GATGCCCGCC GACGCCGGTT CATCGACATC CTGGCCAAAG AGAGCCACGT GCTGGTCCAG
CGCTTCGAAC AAATCGCGGA GGCCACTCAG GAGAACGTCT CCGCCCCCTG GACCATGGCC
GATATCAGCA GCGACGATTT GGTGGACAGC GTGTTGCTGC GCCACCGCGA CACCCTGCCC
CGCGTGGCGC TGGCTGGCCT GCCCCTATGG CTCCATGCGG AGAGCCACGC CATCGGACTC
GTGCTCACCC ATCTGTTGCA CCGTCTGGGA CGGGACCACG GCGTCCTGGC CGTGCGCATC
GAGGCGCTGA TGGGCAACCG CCGGGTGTAC CTGGACATCT CCTGGGCGGG CGAGCCGGTG
CCGGGCCCCA CCCTAGAGCA GTGGCTTGAG ACCCCGTTGC CGGAGGCCAT AGGGGAATTG
AACGCCCGCG CCGTGCTCGA GCGGCACAAC AGCCTGGCCT GGAGTCAACG GGACCGGCGA
ACGCCGGGGT GGGCGTGCCT GCGCATCCCG TTGCCTGCCT CCAGCCGGCA ATGGAACCCG
CCCGGGGAGA GCCTCCCCCC GCGCCCGGAA TTCTATGACT TCTCTCTCAT CGACCAGGCC
GCGGACCAGG GCGCTCTCCT CGACCGGCCC CTGGACGCGT TGAACTACGT GGTGTTCGAC
ACCGAGACCA CCGGCCTGTC TCCAGCGGAG GGTGACGAGA TCGTCTCCAT CGCCGGGGTG
CGAATGGTCA ATGGCCGCCT CCTGGACGGC GAGCGCTTCG AGCAACTGGT CAACCCCGGC
CGGACCATCC CCCGCAGCTC GATCCTGTTC CACGGCATTC ACGATACGAC GGTCGCGGAT
AAGCCCCGCA TCGAAACGGT ACTGCCGCGA TTCCACACCT TCGTCGGCGA CTCAGTGCTG
GTCGCCCACA ACGCCGCCTT CGACATGAAG TTCATCCGAC TGAAGGAGCG GCGCTGCGGC
GTGCGCTTCG ACAACCCGGT TCTGGACACG CTGCTGCTCT CGGTCTTTCT CCACGACCAC
ACCGCGGACC ACACCCTCGA GGCCATCGCC GCTCGGCTTG GGGTGGAGGT GACCGCGCAG
CACACCGCCT GGGGCGATGC CCTGGTCACG GCACGGGTCT TCGCCTGCCT GTTACCGCTA
CTGCGCGAAC GGGGGGTCCA CACGCTAAGG GACGCGGTGG CGGCATCAGA GCGGATGGTG
GAGGTACGCC GGCAACAGGC GCAGTTCTGA
 
Protein sequence
MSVSDHSEHW VERISRGLLG RHATPTGHTS RRLLFWAPAM GLAVLLTGLL LGLAYLSLSA 
LPAGADPTPL VVAFGAAGLL LLAAITAIWL LLDATVLRSL SALARDAAIL AYTNPDHRLQ
LPSVHLLGEL PGTLRNLARQ LQVRRREVEA AAATAAEQAE AQKARLEVVL RAIRQGVVVC
DADGRILLYN PAAGELLHSD ALGLGRSIHE LLNPAAVEHP RQLLQHRLRQ DSDDPVMDEG
VEFVCTTVDD GALLRCQMSL LPTHGPLRSA FVITLEDITR RIEGVARRDQ ALRSAVEALR
SPLAAVSVAA ELLNEYPQID DARRRRFIDI LAKESHVLVQ RFEQIAEATQ ENVSAPWTMA
DISSDDLVDS VLLRHRDTLP RVALAGLPLW LHAESHAIGL VLTHLLHRLG RDHGVLAVRI
EALMGNRRVY LDISWAGEPV PGPTLEQWLE TPLPEAIGEL NARAVLERHN SLAWSQRDRR
TPGWACLRIP LPASSRQWNP PGESLPPRPE FYDFSLIDQA ADQGALLDRP LDALNYVVFD
TETTGLSPAE GDEIVSIAGV RMVNGRLLDG ERFEQLVNPG RTIPRSSILF HGIHDTTVAD
KPRIETVLPR FHTFVGDSVL VAHNAAFDMK FIRLKERRCG VRFDNPVLDT LLLSVFLHDH
TADHTLEAIA ARLGVEVTAQ HTAWGDALVT ARVFACLLPL LRERGVHTLR DAVAASERMV
EVRRQQAQF