Gene Mlg_2465 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2465 
Symbol 
ID4270206 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2799451 
End bp2800716 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content65% 
IMG OID638127223 
Producthypothetical protein 
Protein accessionYP_743295 
Protein GI114321612 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.149567 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.267329 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGATTGA CGACTATCGG TTATTCCGCA CTTCTCGCGG GCCTGGTGGG GCTCCCCACC 
GGCGCCTGGG CGGAGATCCC CGAGGGCACG GTGCTCTCAG CGGACAACAT CGATGAACTC
TACGACCAGA CCTTCCAGGG GCACCGGGTG GGTGACCTGC TGACCGAGCG CCTGGAGTGG
CGCATCCGCG AGTCCGGGTT CGAGATGCCC ATGTATCACA GCCAGGCGAT CGAGCTCGAT
CCCAGCTACC TGGAGGCCAC CGAGGGCAAT CGGGAGGCGG TCAGCTTCAA CCCGGACACC
CGCCAGGTGG AGGGCTGGCA GGCGGGGATG CCCTTCCCGG AGATCGACGA GGATGACCCG
CACATTGCCG AGAAGCTGAT TTGGAACTGG TACTACGGGC AGCCCCGGGG CGATGTGATG
AACGTGCCCA ATGTCACCTA CATGATGGTG GACGCCGACA GCGGCGTTGA CCGCATCCAG
AATTGGTGGT TCCTGCGCTA CACCATGAAG GGGCGGCTGG CCGCGGATGA CCCGGTGGCC
GGTGACGGCA GTGAGCTGAG CCGGACCCTG TTCGTGGCCA CCGAGCCGCG GGATGTGCGC
GGCCTGGGCA CCCTGAGCAT CCGCTACGAC TCCGACGCCA TGGAGGATGT CTGGGCCTAC
ATCCCCGCCG TGCGCCGGGT CCGCCGCCTC TCCGGTGGGG CCTGGATGGA CCCGGTGGGC
AGCACCGATC AGCTCCAGGA CGATATCGAG ATCTTCAACG CCCAGCCCTC CTGGTATGAC
GAATACCGCC TGGTGGAGCG GCGTTGGGTG CTGGCGGCGG CCAATGGCCG GGCGGACAAC
GTCAATGGCA ATGCCGACGA CGTTGACGCG CAGTATCCGC TGTTTGATCT CTCCCAGTCG
CCGTACTGGA ACGTCAACTT CGACCGCTAC GAGCCGCGGG AGGTTTGGGT GATCGAGGCG
ATCCCGCCGA GCGAGCACCC CTATAGCAGG AAGGTGGTGT ACATGGATAC CCAATATCCG
CGGCTCCACT ACGGCGAGGC CTATAACCAG GCCGGCGACT TCTGGAAGTT CCTGCAGTTC
AACTCCACCC CGGGGGAGGG TGACGACGGC TTCCGGGATA TCCGGACCAA TGCCGGCGTG
GTCATCGATT TCCTGCGCAA CCGGGCCACG ATCTTCATCC CCGATCACTC CGAGTGGAGC
ACCAACACCC CGGGTTTCAC CGAGGATGAC ATCGGCGTCT CCACGTTGCG TTCGGTGGCC
CGCTGA
 
Protein sequence
MRLTTIGYSA LLAGLVGLPT GAWAEIPEGT VLSADNIDEL YDQTFQGHRV GDLLTERLEW 
RIRESGFEMP MYHSQAIELD PSYLEATEGN REAVSFNPDT RQVEGWQAGM PFPEIDEDDP
HIAEKLIWNW YYGQPRGDVM NVPNVTYMMV DADSGVDRIQ NWWFLRYTMK GRLAADDPVA
GDGSELSRTL FVATEPRDVR GLGTLSIRYD SDAMEDVWAY IPAVRRVRRL SGGAWMDPVG
STDQLQDDIE IFNAQPSWYD EYRLVERRWV LAAANGRADN VNGNADDVDA QYPLFDLSQS
PYWNVNFDRY EPREVWVIEA IPPSEHPYSR KVVYMDTQYP RLHYGEAYNQ AGDFWKFLQF
NSTPGEGDDG FRDIRTNAGV VIDFLRNRAT IFIPDHSEWS TNTPGFTEDD IGVSTLRSVA
R