Gene Mlg_0196 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0196 
Symbol 
ID4269642 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp227793 
End bp229226 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content71% 
IMG OID638124920 
Producthypothetical protein 
Protein accessionYP_741041 
Protein GI114319358 
COG category[S] Function unknown 
COG ID[COG2170] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0710624 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCCAGG AGATCGACGA GACCCACTTC ACGCCGGAGC ACTTCCACCG CTTTTCCCGC 
CGCCTCAAGG AGGAGACCGC CCTGCTCAAG CAGTGGCTGC AGGAGGGCCG CTTCCGGGAC
GAGCCGGCAC GCATCGGGCT GGAGCTGGAG GCCTGGCTGG TGGATGACCA GGGGTTGCCG
ACCCCACGCA ACGAGGAGGT CATCCGGCAG TGCAATGACC CGCACCTGGA GACCGAGCTG
GCGCGCTACA ACCTGGAGCT CAACACCGAT CCCGTCGCCC TGCCGGGACG GCCGTTCAGC
GAGCTGGCCG GGGTGCTGGA GCAGCGCTGG GCCCGGCTGC GCGAGGCCGC CCGGGGTCAG
CAGTGCCAGC CCACGCTGGT GGGCATCCTG CCCTCGGTGC GTGAGGCGGA TCTCTCGGTG
GCGGCCATGT CCGGCCTAAA GCGCTACCAG GCGCTGAACG AGCAGGTCCT GGCCCTGCGC
AACCGGCGCC CGATCCACCT GGACATCCGC GGCGAGGAGC ACATCACTCT GGACCACCCG
GACGTGATGC TGGAGTCCGC TGCCACCTCC CTGCAACTGC ATCTGCAGGT CAGTCCCGGC
AACGCCCACC GGCTGTTTAA CGCCGCGGTG GCCGCCTCCG CCGCCACCGT GGGCACCGCC
GCCAATTCAC CGCTGCTGTT CGACCACCTG CTCTGGGCCG AGACCCGCAT CCCGTTGTTC
GAACAGGCGG TCGCGGTGAC CCCCCTGCAC GGTGGCCATG CCGGCCCCCT GGCCCGGGTG
GGCTTCGGCA GCGGCTATGC CCGGGATGCC CTCTACGGCT GGTTTGTGGA GAACCGCCAG
CACTTCCCGG TGCTGCTGCC GGTGCTCCAG GACGCCCCGC CGGAGGACCT GGCCCACCTG
CGGCTGCACA ACGGCACCAT CTGGCGCTGG AACCGCCCGC TGGTGGAACC GCACGGCGGG
CACGACCCCC ACCTACGCAT TGAGCACCGG GTGATGGCCG CCGGGCCCAC CCTGGCCGAC
GTGGTGGCCA ACGCCGCCTT CTTCTATGGT CTGGTGCACG GGCTGCTGCT CAAGGAGCCG
GAATTGGAGG GGCGCCTGCC CTTCGTCACC GCCGAGGCCA ATTTCTACAG CGCCGCCCGC
CACGGCCTGG ACGCCACGGT CACCTGGCTG GGCCAGCGCC AGGGGCCGCT GAGGGAGCTC
ATCCTGGAGG AACTGCTGCC ACTGGCCCGG GTGGGCCTGG AGCACCAGGA CGTGGCCGGC
GACGAGATCC GCCACTGGCT GGGGCTCATC CGCGAGCGGG TGACCAGCGG CCGTACCGGG
GCCGCTTGGC AGCGCGCTTG GTGGCACCGC CACGGCCCTA ACGCCCCGGC ACTGGTACAG
GCAATGCTCC GCCACCAGCA GACCGGGGAA CCCGTCCACC GCTGGGCGCT CTAG
 
Protein sequence
MGQEIDETHF TPEHFHRFSR RLKEETALLK QWLQEGRFRD EPARIGLELE AWLVDDQGLP 
TPRNEEVIRQ CNDPHLETEL ARYNLELNTD PVALPGRPFS ELAGVLEQRW ARLREAARGQ
QCQPTLVGIL PSVREADLSV AAMSGLKRYQ ALNEQVLALR NRRPIHLDIR GEEHITLDHP
DVMLESAATS LQLHLQVSPG NAHRLFNAAV AASAATVGTA ANSPLLFDHL LWAETRIPLF
EQAVAVTPLH GGHAGPLARV GFGSGYARDA LYGWFVENRQ HFPVLLPVLQ DAPPEDLAHL
RLHNGTIWRW NRPLVEPHGG HDPHLRIEHR VMAAGPTLAD VVANAAFFYG LVHGLLLKEP
ELEGRLPFVT AEANFYSAAR HGLDATVTWL GQRQGPLREL ILEELLPLAR VGLEHQDVAG
DEIRHWLGLI RERVTSGRTG AAWQRAWWHR HGPNAPALVQ AMLRHQQTGE PVHRWAL