Gene Mlg_1140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1140 
Symbol 
ID4269635 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1333535 
End bp1335151 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content70% 
IMG OID638125889 
Producthypothetical protein 
Protein accessionYP_741979 
Protein GI114320296 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.208936 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCCTGT TTTGGCAGGT CACCGCCCGG ACCGGTTGGC TCCCTTGGAA TAGTTGCCGC 
CTGCCCGAAT CCCTGGCGCA GGCCGACCCG GAGCACGATG CCTCCGGAAA CGGCGTCCGC
ATCGGGATCA CCGCCCGCCA TGACGACGCC GTCGGCACCG GTCCCGGCCA ATCCGGGCAG
AGTTTCGCGG TCACCCCCAG CCTGGCCCTG GACGGTCACG TCATCGTTCC CACCGTGGAG
CCCGGGGCCA ACGGCGCGAC GACGCCGCTG CGAACGCAGG GCCAGCACGC CTTCGACTTC
GGCCGTGGCG AGATCGCCCT CGCGCCCATC CAGCTGCCCG ATCCCGCCAA GGCGGAGCCG
GTCATCGTCC CCTTCGGCCT TGATGCCAAT CACCCCGATG CCCGCTCCAT CGGCCGCTAC
TGCCTGCACA TCGAGCCCGT GCTCCACGGC CATGCCGCCG TCTCCGTGGC GCTGGGCGGC
GGCGTCGGGC TCGACACCGC CGGGGGCCGT CTCGCCGTCA ACGGCCTCGC CCCCGTGGAG
CGCGACGGCG TCGATGCCCG ACTGGAGGCC TTCGCCGGCA CCGGACTGGC CGGCCACAAC
CACTGCCGTC TGCTCTGGCA GCCGCCGGCG AACCTGCTCG CGCGCCTGCC GCGCTACCAG
GCCATGGCCG AGATCGACCG CGCGGGCTAC GCCCGGGACG AGGCCCGCCA GTGGAAAACC
CTGACCCGTG CCGAGATCAA CCCCGAAGTA CGCGTCGGGG TCGGCGGCGA GGCCGCCTTC
CGGCTCGGCC TGCACAACGG CCGCTTCGTG CTGCACGCCT CCCTGCGCCT GGTGCTCGGC
GTCGGCGGCG GGGGCAGCGT GCGCCTGGCC CTCGACACCC GCCACCTCGA CCTCTGGCTC
GCCATGATGC ACCAGGCGCT GGTGGAGGTC GGCTACGAGC GCGTCGACTG GATCGACGAA
GACGCCTTCG AGGAGATGAG CCGCCTGGCC TACCTCGCCG CCATCACCCT GGTCGAACCC
GCCCTGCTCC TGCTGCGCGG CACCCACCGC CTGCGCCAGC TGATCGAATG GTTCACCCGG
GAGCGGGACA TGGCCAGCCG GATCGCCTAC GAACTCGCCG CCGAGGAACC GCCCAACCTC
CGGTATGACC CGGAGGCCAG TCGGGAACAC CACAAGCGCG TCCAGCAACT GCGCGCCTGG
GTGCGCCAAC TGCCGCCCGA GGCCCTCGGG CCGCTGCTTT ATACCCTGAC CAGTCAGCCG
CAGGCGTTCG AGGTGGAGGA GAACCAATAC AACGTGGAGC AAGCACGAGG ATTCCACCAG
CGGGCGATCC TCAACTGCCT GCAATGGATT GTCTCCGGCG TCCTGGCCGG CGTCTACGGG
CCCCGGCGCG AGTTCTCCGC AGAGCACCCG AACCCGGCGC AAAAGTTGTT TGAAAAGGCC
GTGGTGCGCA TGGCCCGAGA CGGACAGCCT ACCGACGCAT CGAGGGCCGA TGCGTATGCC
GAGAACCGAG GGCGGCTGGA TCAGTTCATA TCAGGGGGCC GCGCTACCCC TGAGCAATCC
GATATGCAAA GGAAATACAG ACAGAATGCC GGCTGGCTTT CCCGCCACAT TCAGTAG
 
Protein sequence
MLLFWQVTAR TGWLPWNSCR LPESLAQADP EHDASGNGVR IGITARHDDA VGTGPGQSGQ 
SFAVTPSLAL DGHVIVPTVE PGANGATTPL RTQGQHAFDF GRGEIALAPI QLPDPAKAEP
VIVPFGLDAN HPDARSIGRY CLHIEPVLHG HAAVSVALGG GVGLDTAGGR LAVNGLAPVE
RDGVDARLEA FAGTGLAGHN HCRLLWQPPA NLLARLPRYQ AMAEIDRAGY ARDEARQWKT
LTRAEINPEV RVGVGGEAAF RLGLHNGRFV LHASLRLVLG VGGGGSVRLA LDTRHLDLWL
AMMHQALVEV GYERVDWIDE DAFEEMSRLA YLAAITLVEP ALLLLRGTHR LRQLIEWFTR
ERDMASRIAY ELAAEEPPNL RYDPEASREH HKRVQQLRAW VRQLPPEALG PLLYTLTSQP
QAFEVEENQY NVEQARGFHQ RAILNCLQWI VSGVLAGVYG PRREFSAEHP NPAQKLFEKA
VVRMARDGQP TDASRADAYA ENRGRLDQFI SGGRATPEQS DMQRKYRQNA GWLSRHIQ