Gene Anae109_1848 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_1848 
Symbol 
ID5376860 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp2106396 
End bp2107319 
Gene Length924 bp 
Protein Length307 aa 
Translation table11 
GC content66% 
IMG OID640843356 
ProductRNA polymerase sigma factor RpoH 
Protein accessionYP_001379035 
Protein GI153004710 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02392] alternative sigma factor RpoH
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.370084 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.912724 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGCA TCGCGATCCC GACGACCACC AGCTCCCTCG AGCTGTATCT ATCGGAGATC 
AATCGCTTTC CTCTCCTGAC GGTGGACGAG GAGCAGCGCC TCGCGCGGCT GTTCCGGGAC
GAGGGCGACA CGCGCGCGGC GCACCGGCTG GTCACGGCCA ACCTGCGCTT CGTCGTGAAG
GTGTCCTACG AGTACCGCTC CTACGGCTTC AAGATGGCCG ACCTCATCCA GGAGGGGAAC
ATCGGCCTGA TGAAGGCGGT CCAGAAGTTC GACCCGGACA AGGGGATCCG ACTCATCTCC
TACGCCGTCT GGTGGATCCG GGCGTACATC CAGAACTACA TCCTCAAGTC GTGGTCGCTC
GTGAAGCTGG GGACGACCCA GGCACAGCGG AAGCTGTTCT TCTCTCTCGC GCGGACCAAG
CGCGAGCTCG ACAAGATGTC GGTCGAGCAC GGCCGCGACT CCGACGCGCA GGACAAGGGC
AAGATCGCGA AGAAGCTCCG CGTGAAGCCG GCCGAGGTCG AGGAGATGGA GCAGCGGATG
GACGGCCGCG ATCTCTCCCT CGACGCGCCG ATGGGCGACG ACGGCGGCTA CTCCCACATC
GACTTCGTGG CCGCGAAGTC GGCGGCGCAG GACGACGAGC TCTCCGACGC CGAGGAGCAG
CAGATCGTCT CCGCCAAGGT CACCGCCGCG CTCGCGCGGC TCGACCAGCG CGAGCGGTAC
ATCATCGAGC AGCGCGTCAT GAGCGATCGC CCGCTCACGT TGAAGGAGCT GGGCGAGCAC
TTCGGGTTCT CCCGCGAGCG CGCGCGCCAG CTCGAGATCC GCGCCAAGGA GAAGCTGAAG
CAGGAGCTGC ACGCGCTCGC GGCGGAGATC GACTACCCGA CGGACGGCAG ACCGATCGAC
ATCGACGACG CGGCGCTGGC GTAG
 
Protein sequence
MKRIAIPTTT SSLELYLSEI NRFPLLTVDE EQRLARLFRD EGDTRAAHRL VTANLRFVVK 
VSYEYRSYGF KMADLIQEGN IGLMKAVQKF DPDKGIRLIS YAVWWIRAYI QNYILKSWSL
VKLGTTQAQR KLFFSLARTK RELDKMSVEH GRDSDAQDKG KIAKKLRVKP AEVEEMEQRM
DGRDLSLDAP MGDDGGYSHI DFVAAKSAAQ DDELSDAEEQ QIVSAKVTAA LARLDQRERY
IIEQRVMSDR PLTLKELGEH FGFSRERARQ LEIRAKEKLK QELHALAAEI DYPTDGRPID
IDDAALA