Gene Anae109_4037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_4037 
Symbol 
ID5375442 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp4727443 
End bp4728432 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content76% 
IMG OID640845564 
ProductRNA polymerase factor sigma-70 
Protein accessionYP_001381199 
Protein GI153006874 
COG category[K] Transcription 
COG ID[COG1595] DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family
[TIGR02960] RNA polymerase sigma-70 factor, TIGR02960 family 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGGAC CCGCCGCTCA CCTCGAGGAG CACCGCGCCG CGCTCACCGG GCACTGCTAC 
CGGATGCTCG GCTCGGTGGT GGACGCCGAC GACGCCGTCC AGGAGACGAT GGTCAGGGCC
TGGAGAGGCC TCGACCGGTT CGACGGGCGC TCGTCGCTGC GCACCTGGCT CTACCGCATC
GCGACGAACG TGTGCCTCGA CGCGCTGGCC GACCGCTCGC GGCGGGAGCG CCCGGTGGAG
GAGGGGCCGG CCGGATCGGT GGACGACCCG CTGGAGACGC GTCCACGTTC CCACTGGCTC
GAGCCGGTGC CCGACGCGCG GGCCGTGCCG GCGGACGGGG ACCCGGCGGA GCGGGTGGTG
CTCCGGCAGA GCATCCGGCT CGCCTTCGTG GCGGCGCTCC AGCACCTCCC GCCCCGGCAG
CGCGCCGCCC TGCTGCTCAC CGACGTGCTC GGCTGGTCCG CCGCGGAGGT CGCGCAGGGC
CTCGACACCT CCGTCGCCGC CGTGAACAGC GCGCTGCAGC GGGCGCGCGC CACGCTCGCC
ACGCGCGACC TGGGGGACGA CCCGACGGGC ACGCTCTCCG ACGCGCAGGC AGCGCTGCTC
GACCGTTACG TCGCGGCGTT CGAGCGCTAC GACGTGGACG GGCTGACCGC GCTCCTGCAC
GAGGACGCGG CCATGTCGAT GCCGCCGCAT GCCCTGTGGC TGCGCGGGAG GGAGGCGGTC
CGTGCCTGGC TGCTCGGACG CGGGCTGGGT TGCCGTGGCT CGCGGCTGCT CCCGACCGCC
GCGTGCGGCG CCCCCGCCTT CGCCCAGTAC CGTCCCGCGC CGCAGGGGGG GCACCGGGCG
TGGGGGCTCA TCGTGCTGGA CCTCGCCGGC GACCGCATCT CGGGGTGGAC CACCTTCCTC
GACACCGAGT CGCTCTTTCC GAGGTTCGAG CTCCCGCTGG AGCTCCCGCC GGTCGACGCG
GCCTCGAGCT CGCCGAGCTC GCCCGCCTGA
 
Protein sequence
MPGPAAHLEE HRAALTGHCY RMLGSVVDAD DAVQETMVRA WRGLDRFDGR SSLRTWLYRI 
ATNVCLDALA DRSRRERPVE EGPAGSVDDP LETRPRSHWL EPVPDARAVP ADGDPAERVV
LRQSIRLAFV AALQHLPPRQ RAALLLTDVL GWSAAEVAQG LDTSVAAVNS ALQRARATLA
TRDLGDDPTG TLSDAQAALL DRYVAAFERY DVDGLTALLH EDAAMSMPPH ALWLRGREAV
RAWLLGRGLG CRGSRLLPTA ACGAPAFAQY RPAPQGGHRA WGLIVLDLAG DRISGWTTFL
DTESLFPRFE LPLELPPVDA ASSSPSSPA