Gene Mlg_1314 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1314 
Symbol 
ID4268653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1515713 
End bp1516684 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content67% 
IMG OID638126067 
ProductUspA domain-containing protein 
Protein accessionYP_742153 
Protein GI114320470 
COG category[T] Signal transduction mechanisms 
COG ID[COG0589] Universal stress protein UspA and related nucleotide-binding proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.858313 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.823385 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACACGCA TCTATGCAAC AATAAGCGGG GAGACGAACC GACACCCGCA GCATCCGGAC 
AAGAGCGAAC GGAACATGAC CATCAAGAAA ATCGTCTACG CCACCGACCT CTCCCGCCGT
GCGGACCGTG CCGGGCGCCG CGCACTCCAA CTGGCCAGCG ACCACCAGGC CGATCTGCTC
GCCCTCAGCG TGGTCGAGGC CGACCTCCAG GAGGAGAGCC TGGTCCGGCT GATGCGGGGC
AGCCCGGAGG AGGTGGCCCG GCAGCTGGTC GAGGGCACCG ACAAGGCCCT GGCCGAGCAT
ATGGAGAAGT TGGGCGTGGT GGAGGGCGGT ACCGTGCAGA GCCGGGCCGT GCTCGGGCAC
GGGCACAAGA CCATTCTCAG CGAGGCCAAG GCCTTCGGTG CCGATCTCCT GGTCATCGGT
GCCCACGGCC ACCACCACCT GCGGGACATC TTCCTGGGCA CGACCGCGGA GAACCTGGTC
CGCAACACCG ACCGGCCCAT CCTGGTGGTG AAGAACGAGC CTCAGGGCCG TTACCGCAAG
GTGCTGGTGC CAGTGGATTT CTCTGAGCGG TCCCGGCATG CCCTGGAACT GGCGGTCTCC
TCCGTCGCGG ACGATGGCAC GGTGCAGGTG CTGCACGTGT TCAACACCGC GCCACTGGAC
CGCATCTACC GCACCGGTGC CGATGATGAG ACGGTGCGCC GGATCCACCA GCAGGCCATG
GCGGAGACCC AGCAGGACCT CTCCGCCTTC CTCGCCCAGG CGGAGGTGGA CCTGGACAAG
GTGGAATCGA CCATCCGGGT GGGTTACCCG CCGCTGGTGA TCGAAGAGGC CGCCAACGCC
CTGAACGCCG AACTGCTGGT CATGGGCACC CACGGCCGCA AGCACTGGCA GGACGTACTG
CTGGGGGGCG TGGCCCGCCG GGTGGTCAAC CAGGTGCGCT GCGACGTGCT GCTCTCCCGC
GGTCGCGCCT GA
 
Protein sequence
MTRIYATISG ETNRHPQHPD KSERNMTIKK IVYATDLSRR ADRAGRRALQ LASDHQADLL 
ALSVVEADLQ EESLVRLMRG SPEEVARQLV EGTDKALAEH MEKLGVVEGG TVQSRAVLGH
GHKTILSEAK AFGADLLVIG AHGHHHLRDI FLGTTAENLV RNTDRPILVV KNEPQGRYRK
VLVPVDFSER SRHALELAVS SVADDGTVQV LHVFNTAPLD RIYRTGADDE TVRRIHQQAM
AETQQDLSAF LAQAEVDLDK VESTIRVGYP PLVIEEAANA LNAELLVMGT HGRKHWQDVL
LGGVARRVVN QVRCDVLLSR GRA