Gene Mlg_1139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1139 
Symbol 
ID4269634 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1332466 
End bp1333476 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content61% 
IMG OID638125888 
Producthypothetical protein 
Protein accessionYP_741978 
Protein GI114320295 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.882872 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCTGCA CAAGAACAAT GGTTTTCCTC TTGATCGTTG GCTTATGGGC CGGGACGGCT 
CAAGCCAACA AAGGCGAAGA CACAACATCC CCAGAGCTCG ATGCCGAGCA ACAACAGGCC
AAGGAAGAAG GCATGCGCCT GTACGGCATC CGCTGGCGCA GCGTGGCCAT CCACCACCTG
GAAAAGGCCG CCGAGGCCGG CGATGTTGAG TCCATGTACA CCCTTGGCGA GATCTACCGC
TTTATGGACC GTGGCATGTC CCACGAGGCC ATCGACTGGT ACCACCGCGC GGCGGAGGGC
GGGGATCCCT ACGCCATGCT TCGTCTGAAT TGGGGCATGA TCTGCGAGCT GGCCGACATC
TGCCCCGAAG AGCATGACAC CTGGGCAGAA ATGGCCCTGG GCCAGGAACT CCCCAAAGCC
GAGGAAGGGG ATCCGGATGC CATGCTTGCA CTGTATTCGA TCTATGTTGC GCTGGAGGAG
GTGGAAGAGG GTCGGAACTG GTTACGCAAC GCTGCCAGGG CGGGCCTCCC ACAGGCACAA
GACCTGTGGG CGAGTCGTAT TCAGGAGCGC TCCGGCGAAT GGCCCCCGCC GCTGGAAGAC
GTCAAGGCCG CCGAGCCCTG GTTCCGCAAG GCCGCCGAGC AGGGCTACGC CCCGGGGATG
TACAACCTGT CCCTAGCCTT GCGGGATCAG GAGCGGTATA ACGAAGACTG GAAATGGACG
AAAAAAAGTT CCCGACATGG CCATATCAGC GGTCGTCTCG CCGTTGGCTG GTGCTACCTG
GATAATACCT GGGCGGATTT CTGCCCGGAC GACGCAGATG ACACGGTCAA GGGTTGGGCC
ATACTTCACG CGGTTTATGA AGAGACGCGA GATAGCACGG CCGAGGGCAT TCTTGGGCGA
GAACGCGACC GCATGACCGA AGATGAAATC GCCGAAGCCG AAGAACTCGC CGAGGAGTGG
CTGAACCGCG AGCCCCCGCT GTCCTACTTC CCGCCCAAGT ACGGCCTGTA G
 
Protein sequence
MPCTRTMVFL LIVGLWAGTA QANKGEDTTS PELDAEQQQA KEEGMRLYGI RWRSVAIHHL 
EKAAEAGDVE SMYTLGEIYR FMDRGMSHEA IDWYHRAAEG GDPYAMLRLN WGMICELADI
CPEEHDTWAE MALGQELPKA EEGDPDAMLA LYSIYVALEE VEEGRNWLRN AARAGLPQAQ
DLWASRIQER SGEWPPPLED VKAAEPWFRK AAEQGYAPGM YNLSLALRDQ ERYNEDWKWT
KKSSRHGHIS GRLAVGWCYL DNTWADFCPD DADDTVKGWA ILHAVYEETR DSTAEGILGR
ERDRMTEDEI AEAEELAEEW LNREPPLSYF PPKYGL