Gene Apre_1547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1547 
Symbol 
ID8398359 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1682522 
End bp1683478 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content36% 
IMG OID644995911 
ProductDNA-directed RNA polymerase subunit alpha 
Protein accessionYP_003153289 
Protein GI257067033 
COG category[K] Transcription 
COG ID[COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit 
TIGRFAM ID[TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCGAAA AATTAGATAC AAATATACAG ATATTGGATA TAGACGAAGA AGAAAATTAC 
GGTAAATTCG CCCTATATCC ACTTGAGCGA GGATATGGTA CAACCATTGG AAACAGTATG
AGAAGGGTGC TTTTATCATC CTTACCTGGT TCTAGCGTCT CAAAAATACT TATAGAAGGA
GTGCTTCACG AATTCTCCAC TATAGATGGA GTAGTAGAAG ATGTTCCTGA AATAGTTCTA
AACATTAAGG GTCTAGACGT TACAAAACAT GTAGATGAAG ATGTAACATT GTTTTTAGAC
ATTGAAGGAC CAAAGATTGT AACAGCAAAA GATATCAAAG CAGATAGTTC CGTAGATATA
GCAAATCCTG ACCACTACAT CGCAACAGTT AACGAGAAGT CAAGACTATT TATAGCGATG
GATGTTACAG ATGGTAAGGG TTATAGGGTA TCTGATGATA ACAAGAAAGA AAGCGACCCA
ATCGGTGCAA TCGCAATTGA TTCATCATTT ACTCCAGTTG AGAAAGTAAA CTTTACTGTA
GAAAATACAA GAGTAGGCGA ATCAACCGAC TATGACAAAC TCGTTATGGA AGTTTGGACA
AATGGAACTA TTACACCACA AGAAGCCCTT GCAGAAGGAT CATCAATCTT AATAGAAAAC
TTCTCTTTCT TCAACGAATT GCCTAACCAA CAATTCCCAC CTGAAGTGGA AGAAGAAGAA
ATAGAAGAAG TAGAAGAAGA AGATAGTCTT TCAGAAGATT TGGCAATGAC AATAGAAGAA
TTAGACCTAA GTCTAAGATC ATTTAATTGT CTAAAAAGAG CAGGCTTCGA CAGAGTTGGC
GATATAATCA AGGTTAGCGA ATCTGAGCTA AAAACAATCA AGAACTTCGG TAAAAAGTCA
CTCACAGAAG TAATAGAAAA GCTAGACGAG TTAGGTCTAA GCTTAAAAGA TGAATAG
 
Protein sequence
MIEKLDTNIQ ILDIDEEENY GKFALYPLER GYGTTIGNSM RRVLLSSLPG SSVSKILIEG 
VLHEFSTIDG VVEDVPEIVL NIKGLDVTKH VDEDVTLFLD IEGPKIVTAK DIKADSSVDI
ANPDHYIATV NEKSRLFIAM DVTDGKGYRV SDDNKKESDP IGAIAIDSSF TPVEKVNFTV
ENTRVGESTD YDKLVMEVWT NGTITPQEAL AEGSSILIEN FSFFNELPNQ QFPPEVEEEE
IEEVEEEDSL SEDLAMTIEE LDLSLRSFNC LKRAGFDRVG DIIKVSESEL KTIKNFGKKS
LTEVIEKLDE LGLSLKDE