Gene Mlg_0483 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0483 
Symbol 
ID4268351 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp530140 
End bp531135 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content66% 
IMG OID638125223 
ProductDNA-directed RNA polymerase subunit alpha 
Protein accessionYP_741327 
Protein GI114319644 
COG category[K] Transcription 
COG ID[COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit 
TIGRFAM ID[TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGGGA CCTTCAGGGA TTTCCTCAAG CCGCGCACCG TCGACATCCA GGAACAGGGT 
GAGCGACGCG CGAAGATTGT GCTGGAGCCC CTCGAGCGGG GCTTCGGCCA CACGCTGGGC
AACGCCCTGC GCCGCGTGTT GCTGTCCTCC ATGCCGGGCA GCGCCGTTGT CCAGGCAGAG
ATCGAGGGTG TCGAGCACGA GTACAGCAGC ATGGAGGGGG TTCAGGAAGA TGTTGTCGAC
ATCCTGCTGA ACCTCAAGAG CCTGGCCGTG CGGATGCACG ATCGGGACGA GGCCGAGCTC
ACGGTGTCGG TGCAGGGGCC TGGGCCGGTC ACCGCGGGCG ATATCCAGAC CGCCCACGAT
GTCGAGGTCA AGAACCCGGA GCTCCTCATT TGCACGCTCA CCAAGGCGGT GGCCTTCAAC
GCCAAGCTGA TGGTCGCCCG CGGGCGGGGC TACGAGGCCG CGACCCAGCG TGATGGGGAC
GAGGACCGGG TCATCGGGCG CCTGCAGCTC GACGCCAGCT ACAGCCCGGT GAAGCGGGTG
GCCTACACGG TGGAGAGCGC CCGCGTTGAG CAGCGGACCA ACCTGGACAA GCTGGTCCTC
GATGTGGAGA CCAACGGTGT GCTGGAGCCG GAGGAGGCGG TGCGTTTCGC CGCCGGCCTG
CTGCGCGATC AGCTCTCGGT GTTCGTGGAC CTGGAAGGCG GCGAGTTCGA GGCCGAGCAG
GAGGAGCAGG AGCCCGACGT GGATCCGATC CTGCTGCGTC CGATCGATGA GCTGGAGCTG
ACCGTCCGGT CCGCCAACTG CCTCAAGGCC GAGAGCATCC ACTACGTGGG TGACCTGGTG
CAGCGCACTG AGGTCGAGCT GTTGAAGACG CCGAATCTGG GCAAGAAGTC CCTGACCGAA
ATCAAGGAGA CACTGGCCTC CCACGGCCTG TCCCTTGGTA TGAGGCTGGA AAACTGGCCG
CCGGCCGGTC TGGGCGAAGA TCGCGTCGTG GGCTGA
 
Protein sequence
MQGTFRDFLK PRTVDIQEQG ERRAKIVLEP LERGFGHTLG NALRRVLLSS MPGSAVVQAE 
IEGVEHEYSS MEGVQEDVVD ILLNLKSLAV RMHDRDEAEL TVSVQGPGPV TAGDIQTAHD
VEVKNPELLI CTLTKAVAFN AKLMVARGRG YEAATQRDGD EDRVIGRLQL DASYSPVKRV
AYTVESARVE QRTNLDKLVL DVETNGVLEP EEAVRFAAGL LRDQLSVFVD LEGGEFEAEQ
EEQEPDVDPI LLRPIDELEL TVRSANCLKA ESIHYVGDLV QRTEVELLKT PNLGKKSLTE
IKETLASHGL SLGMRLENWP PAGLGEDRVV G