Gene Mlg_2466 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2466 
Symbol 
ID4270207 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2800737 
End bp2802275 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content65% 
IMG OID638127224 
Productputative regulatory protein, LysR 
Protein accessionYP_743296 
Protein GI114321613 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.030502 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.578084 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAATT GTCTGTTTGA ACAGGGGGGA AGGCGCCGGG CCTCGGTGCT GGGCGTCAGT 
GCCCTGGCCC TGATGACGGC CTCCGGGGCC GCTCAGGCGC TGGATTTGGA CGCCAGCATC
AGCGGGGCCA TCACCCAGGA TATCTCCGTG AACCTGGACA ACCCGGCGTC CTCCAATCCG
GCGCTGGAGG GCAACCACCG CAACCGGCTC TCCATGGTGC GCACCACTCT CCACCTGGAT
ATCGACGCCG AGACCGACTG GGCCAGCTTC GTCACCAAGT CCCGGGTGGT GCGCGAGGCC
AGCACCCGCT ACATGCGGCG GCTGGAGCGC GCCGGTGCCA ATGGCGGTGA CGAGGACAGC
CTGCGCGAGT ATTACAACGA GACCGAGCTG CGCGAGGCCT ATGTCGATTT CTATCCCACC
CGCAACACCG ACGTGCGCCT GGGCAAGCAG CAGGTCGCCT GGGGCGAGAC CGATTTCTTC
CAGGGGACTG ACTTGGTGCA CGGCTTCGAC TTCCGCTGGC GCAGCTTCCT GGAGCCGGCC
AACGAGGAGC TGCGCAAGCC ATTGATCATG GCCAACATCA CCCAGCACTT CCCGAGCCTG
GATGGCTCCC TGCAGGTGCT GGTGCGCCCG GGACTGGACC GGCGCACCGA TATTGGCAAC
AGCTACGACC TGGAGGGGGG GCGCTGGGCC AATACCCCCA ACAAGGGCGT CGACCTGACC
ACCCTGGTGC CCTACAACCT CGAGCACGAT CAGGGGGATT ACCAGGATGT CACTGGCGGC
CTGCGCTGGG AGGGCTTCGC CGGCGGGGTC GGCTACTCCC TGGCCTACCT ACACACCTTC
TCCGCCGACC CGGTAGTGAG TCCGGCCTGG AACCCCTATC GCAGCGAGAC CACCCGGGGG
CCCTGGGGCG AGACCATCTA CCCCACGGTG GATGTGTTCG GGGCCACCGC CAATGGTTAC
GCCCGCTGGG GTGATTTCGT CTGGAGCACC GAGATCGCCT ACATCAAGGA CCAGCCTTAT
AACTTCGGAA CCGTGGCAAA CGCCTCGGCG GACAACGCGG CGGCGGCCGC GTTGGGCCTG
GCCGGGTTTG AGGGCATCGA GACCAAGGAC GTGGTCCGCA GCATGGTGCG GATGGACAAG
GATCTGCGCG GTCTGGGGCG TCTGCTGAAT GCGGACCGGC CTGCCTTCTT CTCCGTGCAG
GTGTTTGACA CCTGGATCAC CAGCTACGAC CGCGACGACG AACTCGTGGA GCTGGTCGGC
TTTGGCCAGC GCAGTCGCGA GAACTCGACC CTGGTGACCG GCATTCTGGG TCTGAGCTAC
CGCAACGGCC GAGTCAACCC GGAGCTGGTG GTCGGCTTCG ACACCAGTTA TGGCGGCGGC
TTCGCCGTCC CCAGTGTCTC GTTCGAGTAC GGCGATAGTC TGCGCTTCAA GCTGGAGGCG
GACCTCTTCT GGGGCGGAGA CGAGGTCGGC AGCCGCTCCC TGTTCGGCTA TTTCGAAGAC
AGTAACCAGC TCTTTGCACG GGCCAAGTAC CAGTTCTGA
 
Protein sequence
MTNCLFEQGG RRRASVLGVS ALALMTASGA AQALDLDASI SGAITQDISV NLDNPASSNP 
ALEGNHRNRL SMVRTTLHLD IDAETDWASF VTKSRVVREA STRYMRRLER AGANGGDEDS
LREYYNETEL REAYVDFYPT RNTDVRLGKQ QVAWGETDFF QGTDLVHGFD FRWRSFLEPA
NEELRKPLIM ANITQHFPSL DGSLQVLVRP GLDRRTDIGN SYDLEGGRWA NTPNKGVDLT
TLVPYNLEHD QGDYQDVTGG LRWEGFAGGV GYSLAYLHTF SADPVVSPAW NPYRSETTRG
PWGETIYPTV DVFGATANGY ARWGDFVWST EIAYIKDQPY NFGTVANASA DNAAAAALGL
AGFEGIETKD VVRSMVRMDK DLRGLGRLLN ADRPAFFSVQ VFDTWITSYD RDDELVELVG
FGQRSRENST LVTGILGLSY RNGRVNPELV VGFDTSYGGG FAVPSVSFEY GDSLRFKLEA
DLFWGGDEVG SRSLFGYFED SNQLFARAKY QF