Gene Emin_1390 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1390 
Symbol 
ID6262862 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1491357 
End bp1492346 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content41% 
IMG OID642611870 
ProductDNA-directed RNA polymerase, alpha subunit 
Protein accessionYP_001876276 
Protein GI187251794 
COG category[K] Transcription 
COG ID[COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit 
TIGRFAM ID[TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones74 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTTTA ATCAGTTAGT TTTACCGACA AAAATTCAGT TGGATGAAAA AACAGTAACC 
CCGTTTTACG GTCGCATAAT AGCGGAACCG TATGAAAGCG GCTATGGTCA TACTGTTGGA
AATTCACTTA GAAGAATTTT ACTTTCGGGA CTTGACGGTT CGGCTGTAAC GGCTGTCAGA
GTTAGGGGTG CTGTACACGA ATACAGCACA ATCCCCAACG TTAAGGAAGA TGTTATCAAT
ATTTTGCTCA ACCTTAAAAA ATTAAGAGTT AAACTTGAAG GGAAAAACAG AGAATATGTT
TATTTAACTG CTTCTAAACC CGGCAAAGTA ACGGCAAAAG ATATTGCGGA AGTATCCGGC
GTTGAAATCA TTAATAAGGA TTTGGAAATC GCTAATTTAG AACAGGGCGG CAAACTTGAG
CTTGAAATTG AAATTTCACA AGGCAAAGGT TATGTTCCTG CGGAAGATTT AAGCAAAATC
CAAAGACCTG CGGGCTTTAT TCCCGTGGAC GCAATTTTCT CACCCATTCT TAAGGTTCAC
TATGATGTTG AACCCGCGCG CGTAGGGCAG AAAACGGATT ATGACAGGCT TGTTATACAA
ATAACCACAG ACGGTACTCT TGAACCTGCG AAGGCTTTAC ATAAAGCGGC GGTCCTTCTT
TCACAATCAC TTCATATTTT CACGATTGAA GGTGAAGAAG TTAACGCTGC GGCGCCTGAA
ACTGAGCCTT TGTCAACCAC AGGCAGCGTA AGCGGCGTAA GCGCGGTTAA CAGCAAAGTT
GAAGAACTTT TAAACCAGTC TGTTGAGTTT ATTGAACTTT CATCACGTTC AATTAACTGC
CTTAAATCAG AAGGCGTAAA CACGGTTAAA GATTTGGTAA GCAAGACTGA AGATGATCTC
AAAATGATAA AGAACTTTGG TACTCGTTCA CTTGATGAGG TTAAAGAAAG ACTTGCGGAA
ATGAATCTTT CCCTCGGTAT GAAATTTTAA
 
Protein sequence
MAFNQLVLPT KIQLDEKTVT PFYGRIIAEP YESGYGHTVG NSLRRILLSG LDGSAVTAVR 
VRGAVHEYST IPNVKEDVIN ILLNLKKLRV KLEGKNREYV YLTASKPGKV TAKDIAEVSG
VEIINKDLEI ANLEQGGKLE LEIEISQGKG YVPAEDLSKI QRPAGFIPVD AIFSPILKVH
YDVEPARVGQ KTDYDRLVIQ ITTDGTLEPA KALHKAAVLL SQSLHIFTIE GEEVNAAAPE
TEPLSTTGSV SGVSAVNSKV EELLNQSVEF IELSSRSINC LKSEGVNTVK DLVSKTEDDL
KMIKNFGTRS LDEVKERLAE MNLSLGMKF