Gene Mlg_2053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2053 
Symbol 
ID4270187 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2327416 
End bp2329560 
Gene Length2145 bp 
Protein Length714 aa 
Translation table11 
GC content67% 
IMG OID638126809 
Productputative PAS/PAC sensor protein 
Protein accessionYP_742885 
Protein GI114321202 
COG category[L] Replication, recombination and repair 
COG ID[COG2176] DNA polymerase III, alpha subunit (gram-positive type) 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00573] exonuclease, DNA polymerase III, epsilon subunit family
[TIGR01406] DNA polymerase III, epsilon subunit, Proteobacterial 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.115114 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCAGC GGCCCTTCGT CACCTGGGTC CTGACGGGCG CAGTTGCCGT CGCCGCCGCC 
ATAGTGGTGC TCGCGGTCTA CAGCGTCACG CCCAGCGAGC CGGGGGGCAC CGATAACACC
GCGCTGGTGG CCAGCCTCGG GGTACTTGGG GTGGTGGCCG TCGGGCTGGT CAGCTGGCTC
TTGCTGGAGC GCCTGTTGAT GCGGCCCTCC CGGAAGCTCA GCCGCGGGGT CCGGGCGCTG
CTGGAATCCC GCCAGGCGAA CCAGGAGATC ATCCTCCCCC GGCGCCATGC CCTCGGGGAC
CTGCCGACCA CGGTGGAGTC ACTGGCCGAT GCGTTGCGCA AGTCGCGGCG AGAGACCCGC
AAGGCCATGC AATCGGCCAC CGCCGAACTC TCCGAGCAGA AGACCTGGCT GGAGACCATC
CTGCAGGGAC TCAGCGAAGG GGTGCTGGTC TGCAACCGCC AGCACCAGAT CATGCTCTAC
AACCGGGCGG CCGGGACTAT CCTCGGACAC CCGGAGGCCA TCGGTCTCGG CCGCCCCCTG
TTCAACGTGC TCTCCAGTCC GCCGGTGCAG CACACCCTGG AGCGCCTGGA GCGCCGCCAC
AAGGGCGATG TGGACCTGCC GACGGAACTC TCCGCCCCCT TCGTCTGCAC CAGCGCCGAT
GCCCAGCGCA TGTTCCATGG GCGCATGGCC TTGATTCAGA ACAGCCAGGG CCAGATTACC
GGTTATCTCA TCACCCTGGT GGACATCTCC AGCGATCTCA CCCGACTGGC CCAGGGTGAC
GCCGTGCGCC GGGCGCTCAC CCGCGACCTG CGTGGCGTGG TGGGCAACCT GCGCGCCGCC
GCGGAGACCG TGGCCAGCTA TCCGGACATG AAGCCGGAGG AGCGCCAGTC GTTCGATGCC
GTCATTCGCA GCGAGAGTGA GAAGCTGAGC GAGCAGATCG ACGAGCTGGC CCACCAGATC
CGGGGCTACA ACCTGGGGCG GTGGCCCATG GCCGACGTCT TCGTGGGGGA TCTGGTCAAC
TGCCTGCAGC AGCGGCTGAT GGACCTCCCG GACGTCCGGT TGACCCTGGT GGGGCTGCCG
CTCTGGGTCC ATGGCGACAG CCTTTCACTG ATGCTGGCGC TGGATTGCCT GGTGCGACAG
GTGCATGACC ATACCGGCGC GACGGCCTTC GATGTGGAGG CGTTGCTGGG CGATACCCGG
GTGTACGTGG ACATCAGCTG GCAGGGCGAG CCGATCACTA CCGGCGAGCT GAACCGCTGG
ATGGCGATCC CCTGCGGCGG CGATGAGGTG GGCGGCCAGC ACCTGGGGGA TATCCTGGAG
CGGCATGGCT GTGAGCCCTG GAGCCAGACG GGCAGGCGCG AGGGGCAGGC CTTGTTGCGC
TTGCCGCTGA TGATGTCCCG CCGGCCCCAG TTCATGGAGG AGGAGGAGCG GCTGCCGGCA
CGGCCCGAGT TCTACGACTT CGGGCTGATG CAGGAGTATG CCGGGGACGA GGCCCTGGCG
GCCCGGCGGC TGGCCGACCT GAGTTTCGTG GTGTTCGATT GCGAGATGAC CGGGCTGAAC
CCCGAGGGGG GCGATGAGAT CATCTCCATC GCCGGGGTCC GGGTGGTGAA CGGCCGGGTA
CTGACCGGTG AGACCTTTGA CCGAATCATC AATCCGGGAC GCCCCATCCC GCCTGGCTCC
GTGCGCTTCC ACGGCATCAC CGACGACGAT GTGCAGGACA AGCCGCCCAT CGAGGTGGTG
CTGCCCCAGT TTAAATCCTT CGTGGGGGAC GCCGTCCTGG TGGCGCACAA TGCCGCCTTC
GATATGAAGT TCATCAGCAT GAAGGAGCGC GAGGCCGGGG TGCGCTTCGA TAATCCGGTG
CTCGACACCC TGCTGCTCTC GGCCCTGCTG GACGGTGATG AGGAGGACCA CTCGCTGGAC
GCGCTGTGCG ACCGCTACGG CATTGCCATC ACCGGCCGTC ACACCGCTCT GGGCGATACC
CTGGCCACCG CCGAGCTGCT GGTTCGTATC ATTGAGCGGC TGGAGGCGCA GGGCTACCAG
ACGTTCGGCG AGGTGATGAA GGCCTCGCGG ATGGCCGCGG AACTCCGCCA CCGCTCCGCG
GTCTTCAGTG CACAGGGCGA GGGGATGGAG TCCTCCCGCA CCTGA
 
Protein sequence
MRQRPFVTWV LTGAVAVAAA IVVLAVYSVT PSEPGGTDNT ALVASLGVLG VVAVGLVSWL 
LLERLLMRPS RKLSRGVRAL LESRQANQEI ILPRRHALGD LPTTVESLAD ALRKSRRETR
KAMQSATAEL SEQKTWLETI LQGLSEGVLV CNRQHQIMLY NRAAGTILGH PEAIGLGRPL
FNVLSSPPVQ HTLERLERRH KGDVDLPTEL SAPFVCTSAD AQRMFHGRMA LIQNSQGQIT
GYLITLVDIS SDLTRLAQGD AVRRALTRDL RGVVGNLRAA AETVASYPDM KPEERQSFDA
VIRSESEKLS EQIDELAHQI RGYNLGRWPM ADVFVGDLVN CLQQRLMDLP DVRLTLVGLP
LWVHGDSLSL MLALDCLVRQ VHDHTGATAF DVEALLGDTR VYVDISWQGE PITTGELNRW
MAIPCGGDEV GGQHLGDILE RHGCEPWSQT GRREGQALLR LPLMMSRRPQ FMEEEERLPA
RPEFYDFGLM QEYAGDEALA ARRLADLSFV VFDCEMTGLN PEGGDEIISI AGVRVVNGRV
LTGETFDRII NPGRPIPPGS VRFHGITDDD VQDKPPIEVV LPQFKSFVGD AVLVAHNAAF
DMKFISMKER EAGVRFDNPV LDTLLLSALL DGDEEDHSLD ALCDRYGIAI TGRHTALGDT
LATAELLVRI IERLEAQGYQ TFGEVMKASR MAAELRHRSA VFSAQGEGME SSRT