Gene Mlg_1143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1143 
Symbol 
ID4269638 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1337415 
End bp1339508 
Gene Length2094 bp 
Protein Length697 aa 
Translation table11 
GC content72% 
IMG OID638125892 
Producthypothetical protein 
Protein accessionYP_741982 
Protein GI114320299 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACT CCATCGCCCC CCAGGTCTGG CCCGCGGGAC CGCGCGGCCG CGACGATGAC 
CTCTTCCACG AAGACCCGCG CCCCCTGTGC GCAACGTTCC TGTGGGCGGA CATCGTCTAT
CTCACCGGCT CCGGCGAGTT CTGGCTGCTC AACGCCACAG CCGCCGCCGC GATGCACCAC
GCGGCCGACA AGCTGGCCGA CATCGCCGCC GTGGACGACC GCGACGAGCG CAACCGGCGG
CTGTCCGAGG AGGCGGGCGT GCTGGACAGC TTCCTGCCAG CGCATCCGGT CAGTTTCCTG
GGGGAAGCGG ACCGCCAACG GTTCGCGCAG ACCCTCCAGC AGCTCGCCGC CCTTCAGGAC
GAGGCCCCGG ACACGCTGCT GCAACGGGTG GTGGACGGCG TGCCTTTCCA GGGCACCACC
AGCGTGAGCA CCTCGCAATC CTGGTCCGGC GATCATTCAA TGCCTTCGGC CGTACCGACC
CAATGCCCGG AACCGGCACG CATCCACGAG AGCAACGACC ATCTCGACGC ACTTCAGGCC
CTTTATCAAC GCGGCCTGGA CAAGGCCGAG AAGGCCGGCT ATGTCGTCGA TAGCGCCCTG
GTCCACGGCG ACAGCGAGGC GCGCATCCGC GAGGCGCTGC AGCGCTATCA CCGGCGGCGG
GAGCTTGCTT TCCAGGGTGC TCGGTCCCAC CTGGAACAGG GTCAGGGCCT GCCACCGACG
CGGCCGCTCC ATAAGATCCT GGAGCAATAC CGCCGGCATG TGGCGCTCTG CGACCAGGAC
CCGGTACCAG AGGCCGTCGA ACGCTGCGAG ATCGCCTCCG TGATCGAGCA CTACATCCCG
CAACTGGAGC AGGACTACCG CCACTACATC GACAGTCTCA TCGAGCTGGC CGGGCTGGGC
GTGGCCACCC CCGAGCTGGC GCTGGCTGAG GACCCGGACG CCGGCTTCGC CGACGGCGTG
GACTACGTGG CCCGCTACTT CGCGACCCTG GACGAACTCG ACGCCTTGCG CGAGGACGTG
GACACGCGGC TCAGGGAGTG GGAACAGGGC ACGGGCCGGG CCACCCCGCT GCCCATCTTC
CTGTTCACCG ACGAGCAGGC GCGGTTCGAC CGGCTGCGCG AGCGCATGGA CCGCCTCTAC
CGCACGGCCC GGCGCCGGGT GGACCGCACG CGCCCGAGGC GCGTCCTGCA CTGGGACCTG
GGCCCCGACG ACATCCGCGA CCCCGAACCC TACCGCCCGC CCCCGATCCA CCGGCTGGTG
CGCGCCGACT TCCCCCTGCG CGAGTTCAGC GGCCCCGGTC GGCAGCGCAC CCTCGATCAC
CTGAGCCTGC ACCAGCTGGG CGAGACCCGC CCCCACTACG CCCGCCAGCG CGATGCCGCC
ATCGCGCACG ACAGCCGCAC GGTCACCGAG CCCCGCTCCC TGCCCGACAC GGCACTGACC
GGGTGGCTCA CCCGACGCGG CTGCCGGCGG CTCGACTGGA ACCCCGACTG GCACAGCGAG
CCGCTCGGGC TGTTCGAGCC GGAACGCTTC TTCCACGACC TCGACCACCA GGGCCTGGTC
ATCGACCGCC TTGCCGACGA CAGCGCCCGG GAGGAGTGGG GCCGGCGCCT GCGCCGGATC
CTGTTCGCCG ACCCGCTCAA CCACCCCATG CGCCTGTTCG ATGCCAGTGG CCCGGCGCAG
CTCCTGCGCC TGCTGGCCGG CGCATACGCC GAGCCCGACC GGCGCGACCG GGCGCTGTCC
GGGGAGGCAC CCCTGTGGCT GCGCCGGCCC GGACCCGTGG CGCAGGCCGA ACCGGAGACA
GCCGCTTCCG GAACCGGTGC CCGCATCGGC CTGACCGCCC GCTACCAGGA CACCGCCGGC
ATCGACACCG GGGGCAACGC CGGCGGCCGG ACGGTCAGTG TGGCCCCCAG CCTCGCTCTC
GACGAGCTGG GCATCACCGC CACCGTGGCC CCGCGGGCAG GACGGCTCGC GGGTCGTGTT
GCGATTGCGC GGCCAGCACG CCTTCGACTT CGGCCGTGGC GAGATCGCCC TCGCGCCCAT
CCAGCTGCCC GATCCCGCCA AGGCGGAGCC GGTCATCGTC CCCTTCGGCC TTGA
 
Protein sequence
MTDSIAPQVW PAGPRGRDDD LFHEDPRPLC ATFLWADIVY LTGSGEFWLL NATAAAAMHH 
AADKLADIAA VDDRDERNRR LSEEAGVLDS FLPAHPVSFL GEADRQRFAQ TLQQLAALQD
EAPDTLLQRV VDGVPFQGTT SVSTSQSWSG DHSMPSAVPT QCPEPARIHE SNDHLDALQA
LYQRGLDKAE KAGYVVDSAL VHGDSEARIR EALQRYHRRR ELAFQGARSH LEQGQGLPPT
RPLHKILEQY RRHVALCDQD PVPEAVERCE IASVIEHYIP QLEQDYRHYI DSLIELAGLG
VATPELALAE DPDAGFADGV DYVARYFATL DELDALREDV DTRLREWEQG TGRATPLPIF
LFTDEQARFD RLRERMDRLY RTARRRVDRT RPRRVLHWDL GPDDIRDPEP YRPPPIHRLV
RADFPLREFS GPGRQRTLDH LSLHQLGETR PHYARQRDAA IAHDSRTVTE PRSLPDTALT
GWLTRRGCRR LDWNPDWHSE PLGLFEPERF FHDLDHQGLV IDRLADDSAR EEWGRRLRRI
LFADPLNHPM RLFDASGPAQ LLRLLAGAYA EPDRRDRALS GEAPLWLRRP GPVAQAEPET
AASGTGARIG LTARYQDTAG IDTGGNAGGR TVSVAPSLAL DELGITATVA PRAGRLAGRV
AIARPARLRL RPWRDRPRAH PAARSRQGGA GHRPLRP