Gene Apre_1135 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1135 
Symbol 
ID8397923 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1219236 
End bp1221044 
Gene Length1809 bp 
Protein Length602 aa 
Translation table11 
GC content38% 
IMG OID644995481 
ProductRNA polymerase, sigma 70 subunit, RpoD subfamily 
Protein accessionYP_003152882 
Protein GI257066626 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00130789 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAGCG ATGACAGTAA GTTGCTCGAT GATGATGTTC TAAAAGAAAT AATAGAAGAA 
TTTCAGTCTA TGAGCGAGTC TCAAGATGGA TTGATTACCT ATTCTCAGAT AAATACATTC
GAAAACTTCA AGGAAATGGA AGATGAGAGC AAGAAGTTTG TTATAAATCA GATTATAAGT
TTAGGCATAG AAATCGTTGA AAAATCTGAA ACAGTCCTTG AAGAAGATGA AGAAGCGGAT
GATTTAGATG ATGAAATAGA TGATGACGAC GAAGACGATA TCGACGAGAA GAAGATAGAA
GAAGATATCA AGAAATCTGT GGATATTATG GCTGGAGTCA AGGTTGATGA CCCTGTAAAG
ATGTATCTTA AGGAAATAGG CAAGGTAGAC CTTCTTACAG CGACAGAAGA AATAATACTC
GCTCGTAGGA TGGAAATGGG AGAGATTGCA AGGAAGAAAC TCGAAGGAGT TAAGTTTAAC
AAGCAAAACA AGACCAAACT CGCCAATGAT ATCTTCTTCG GAGATCTAGC CAAGATACTC
AAAGTATCCA ATGAAAAGAT TGTAGAAAAT GGAAGTCTTG ATAAGAAAAG CTACGAAGAG
ATAAGAGGGC TTATCAGCAT GGGCGATCTT TTTAGAGAAA TTATGGACAA AGATATGGAT
AAAGAAGCCC TAGAGGAACT CAAAGCAAGG GTTTTGATTT GTAATATAGC AGAAAATGCT
GTAAGCGACA ATGTCTTAGA TAGAAGTGAG GCCAATTCTC TTTGTGATCT TTTTGATTCA
TTTGACCATA TGCTTAACAT TACCGAGTCA GATGAGATAG TTAATTTAAT CTATTTATCA
TTTAATGATA TCCTATCAAA GGCTGCCAAT AAGGAACATA TCAAGACAAA CGATAGTATG
ATTCTAGAAG ATTTCATAAT TACTAGCCAG AGGGCTAAGA AGATTTCTGA AAACATAGCC
TTAGATAAAG ATGAGCTAAT GGAGATAGAA GAAACTATCG CCCTAAGAGA TCTTGCCTAC
AAAGTGATAA ATGACGAAGA CTTAAGCGAT GAAGATATAC TAGATTTAAG AAGAGCTGTT
CTTAATTCTA AAAGAGCTAA AAAGAAACTT GCAGAAACTA ACCTTAGACT TGTAGTTTCT
ATTGCCAAAA AATATGTCGG TCGTGGCATG AGCTTTTTAG ACCTTATCCA AGAAGGAAAC
ATGGGACTTA TGAAGGCAGT TGATAAGTAC GACTACAACA GGGGCTTCAA GTTCTCAACT
TATGCGACCT GGTGGATTAG ACAGGCTATT ACTCGTGCTA TAGCAGATCA GGCAAGGACT
ATAAGAATTC CAGTTCACAT GGTAGAAACA ATAAACAAAC TCGTAAGAAT CCAAAGGCAG
CTAGTTCAGG ATTTGGGTAG GGATCCTTCC AATGAAGAGA TTGCCGAGCA GATGGGACTA
GAAGTAGAGA AGGTTCAAGA GATAAGAAAA ATATCCCAAG AACCAGTTTC CTTGGAAACT
CCAATAGGAG AAGAGGACGA CTCCCACTTA GGAGATTTTA TAGAAGATGA TAGTGCGATC
GACCCTGGTG AAGCTGCAAA CTACACTATG CTCCGTGAGC AATTAAACGA TGTTTTATCT
TGCTTGGGTG CTAGGGAAAA ACGTGTCCTC CAATTAAGGT TTGGCCTAAT CGATGGAACA
CCAAGGACCC TCGAAGAAGT AGGCAAGGAA TTTGACGTAA CTAGGGAGAG GATAAGACAA
ATAGAGGCCA AGGCCCTAAG AAAGCTTAAA TCTCCGAACA AGAGTGAATT ATTGAAAGAT
TTTTTATAA
 
Protein sequence
MTSDDSKLLD DDVLKEIIEE FQSMSESQDG LITYSQINTF ENFKEMEDES KKFVINQIIS 
LGIEIVEKSE TVLEEDEEAD DLDDEIDDDD EDDIDEKKIE EDIKKSVDIM AGVKVDDPVK
MYLKEIGKVD LLTATEEIIL ARRMEMGEIA RKKLEGVKFN KQNKTKLAND IFFGDLAKIL
KVSNEKIVEN GSLDKKSYEE IRGLISMGDL FREIMDKDMD KEALEELKAR VLICNIAENA
VSDNVLDRSE ANSLCDLFDS FDHMLNITES DEIVNLIYLS FNDILSKAAN KEHIKTNDSM
ILEDFIITSQ RAKKISENIA LDKDELMEIE ETIALRDLAY KVINDEDLSD EDILDLRRAV
LNSKRAKKKL AETNLRLVVS IAKKYVGRGM SFLDLIQEGN MGLMKAVDKY DYNRGFKFST
YATWWIRQAI TRAIADQART IRIPVHMVET INKLVRIQRQ LVQDLGRDPS NEEIAEQMGL
EVEKVQEIRK ISQEPVSLET PIGEEDDSHL GDFIEDDSAI DPGEAANYTM LREQLNDVLS
CLGAREKRVL QLRFGLIDGT PRTLEEVGKE FDVTRERIRQ IEAKALRKLK SPNKSELLKD
FL