Gene Apre_1642 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1642 
Symbol 
ID8398454 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1784272 
End bp1785303 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content38% 
IMG OID644996006 
ProductRNA polymerase, sigma 32 subunit, RpoH 
Protein accessionYP_003153384 
Protein GI257067128 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000192835 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTGATA AAATGGATAA AATAAATTTA AAAGATAAAA AAACCCTAGA GAAATTACAA 
GAGTATCTCA ACTCAGAGGA CTTGACCGAA GAGGAGATTA TCTCAATTCT TGAAGGTTTA
ACAGAAGAAG AAAAAGACGA GATAATGGAT ATTATATCAG ACGAAGTGGA TGACGATGAT
GAAGATGATG ACTTTGAAGA AAGAAAGACA AATGTGTCAA AAGCTTCCAT CATGCCTATA
AGCCGCCGTG ATATGATAGA GCTATCAGAC CTTACTAATG AACAAATAGT GGAACAATTC
CAAATAGGAA ATCAGAATGC CCTAGCAGCC CTTGTAGAAA AAAACCAAGG ACTTGTTAGA
AGTAGGGCCT CATATTTCTT TAGATCTCAC GGAAACGATC TAGACCTAGA GGACTTAGTC
CAATCAGGTA TGCTCGGTAT GATTCGTGCG GCAGAAAAGT TCGACCTATC CCTAGGCTAT
AAGTTTACAA CCTATGCCTA TAAGTGGATC GATAAGGCCA TAAGAAAGGC CATAAACAAG
GAAGGCCACA CTATAAGAAT ACCTGCCGGT AAATACCTAA AACTTAATAA GCTTAAGCAA
ATTCTTAAAG CAAATCCAGA AGCAAGCGAT GAGGAGCTTT ATAGGATTTT GGAAAAGGAG
GGAATCGATA AGAAACAAGC AGACGACCTT TTCCTAATAA ATAGAAACCA AGTAAACTCC
ACATCCCTTA ACATCAACTT GGACAGTGAG GATTCGACAG GTGATGAGCT TATGGATATG
GTAGGAGATG AGTCAACTCC AGTCGATATG CTAATACTCG AAAAAGACAT GGAAAACTTC
CTCCTTAAGG CCCTAGACCA ACTAACAGAT AGGGAAAAGC AAATCATAAT ATTTAGATAT
GGACTAGATA ACGAAAAACC TAAGACCCTT GAAGAAATAG GTAAAATCTA CGACTTATCT
AGAGAAAGAA TCAGACAAAT TGAAAATCAA GCCTTGGGCA AACTGAAAGA ATTTTCTGAA
AGAGAAGAAT AA
 
Protein sequence
MGDKMDKINL KDKKTLEKLQ EYLNSEDLTE EEIISILEGL TEEEKDEIMD IISDEVDDDD 
EDDDFEERKT NVSKASIMPI SRRDMIELSD LTNEQIVEQF QIGNQNALAA LVEKNQGLVR
SRASYFFRSH GNDLDLEDLV QSGMLGMIRA AEKFDLSLGY KFTTYAYKWI DKAIRKAINK
EGHTIRIPAG KYLKLNKLKQ ILKANPEASD EELYRILEKE GIDKKQADDL FLINRNQVNS
TSLNINLDSE DSTGDELMDM VGDESTPVDM LILEKDMENF LLKALDQLTD REKQIIIFRY
GLDNEKPKTL EEIGKIYDLS RERIRQIENQ ALGKLKEFSE REE