Gene Anae109_3801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_3801 
Symbol 
ID5376514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp4430003 
End bp4431586 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content75% 
IMG OID640845326 
ProductRNA-binding S1 domain-containing protein 
Protein accessionYP_001380964 
Protein GI153006639 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0539] Ribosomal protein S1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.289517 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.106411 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGAGG AGAAGAAGCC CGAATCCCAG GGCCCGGTGG TGATCCGCAA GGGCCACGTG 
AAGCCGCCCC CTCCCGGCGC CAAGATCGAG GCGCCGCAGG AGGAGCGGCT CGAGCCCGCT
CCGCAGCCCC GCGACGCCCG CCCGCTCTGG CAGCGGGTCG CGGAGGAGCG GTCCGGTGCC
GGCCGTGCGG CGCCCTCCGG CGGCCCCGGC GGCCCCGGCG CCCGGCGCGG GCCGCCGCGC
GGCGAGCGTC CGCCGCGAGG AAACGATCGC GGCGAGCGCC GCGATCGGCC GCGCGAGGCC
CGCGGCCCCG AGGGCGAGCG GCCGCGCCCG CCCCCGCGCG CTCCGGAGCC GCCGCCGGCG
CCCGAGGTGG CTGCGCCCGA CGAGGGGTCC TTCGCGGACA TGCTCGCCAC GTCCGAGAGC
GGCGCGCCGC GCCGGCGCTT CCAGGTGGGC GAGAAGGTCG CCGGCAAGAT CATCCAGCTC
GGCCACGACG TCGCGTTCCT GGAGCTCGGC TCCGGCGTCG CCGAGGCGAT GATCGAGGTC
GCGGAGCTGA AGGACGCCGA CGGGAACGTC ACCGCGCGCG AAGGGGACAT CCTGGACGCG
GTGGTGATCC ACGCGGACGA CCGCGGCGTC ATCGTCTCGA AGGGACACGC TCGCGCCACG
CGCGATCACG CGCGCGAGGC GGTGATCGAG GCGGCCCAGA CCGGGCTGCC GGTGGAGGGG
CTCGTCAAGG CCGCGAACAA GGGCGGCCTC GAGGTCGAGG TGCACGGCGT CCGCGCCTTC
TGTCCCATGT CGCAGATCGA CGTGCGGTTC GTGGGAGACC CCTCGACGTT CGTCGGCCAG
AAGCTCCAGT TCAAGGTGCA GCGCGCCGAC GCGCGCGACT GCGTGCTCTC CCGCCGGGCG
CTCCTCGAGG AGGAGCGCGC CGAGAGGGCG CGCGCCACCC GCGAGCGGCT GGCTCCCGGC
GCGGTCTTCG ACGGCGTCGT GACGAGCGTG CAGGACTACG GCGCGTTCGT GGACATCGGC
GGAGTGGAGG GGCTCGTCCA CGTCTCCGAG CTGTCCTGGG ACCGCGTCTC GAAGCCGCAG
GACCTGCTCA CGGCCGGCGA CGCGGTGCAG GTGCAGGTGC TCCGCATCGA CGAGGACCCG
AAGAAGGGCG AGCGCATCGG CCTCTCCGTG AAGACCCTCG CGCCGAGGCC CGAGCCGGTG
GCGGCTCCGG CGGGCGAGAA GCCGGCCCGC CCTGCCCCGC CTCCGCCGCC GAAGGCGGGG
GACGTGGTCG ACGTGTCGGT GGACAAGGTC GAGAGCTTCG GCGTGTTCGT CCGCTTCGCC
GGCGGCCGCG GCCTCGTCCC GGCGAGCGAG ACCGGCACGC CGCGCGGCTC CGACCTGCGC
AAGTCGTTCA AGGTCGGCGA CGGCTTCCGC GCGCTCGTGC TCGCCATCGA CGAGCAGCAC
CGCATCCGCC TGTCGAAGAC CGGCGCCGAG GAGGCCGCGG AGCGCGCCGA GGCGGCGGAC
TACATGAAGA AGTCGCCCCG CCCCTCCGGG AAAGGCTTCG GCACGCTCGG CGATCTGCTC
CGGCAGAAGC TCGAGAAGAA GTAG
 
Protein sequence
MSEEKKPESQ GPVVIRKGHV KPPPPGAKIE APQEERLEPA PQPRDARPLW QRVAEERSGA 
GRAAPSGGPG GPGARRGPPR GERPPRGNDR GERRDRPREA RGPEGERPRP PPRAPEPPPA
PEVAAPDEGS FADMLATSES GAPRRRFQVG EKVAGKIIQL GHDVAFLELG SGVAEAMIEV
AELKDADGNV TAREGDILDA VVIHADDRGV IVSKGHARAT RDHAREAVIE AAQTGLPVEG
LVKAANKGGL EVEVHGVRAF CPMSQIDVRF VGDPSTFVGQ KLQFKVQRAD ARDCVLSRRA
LLEEERAERA RATRERLAPG AVFDGVVTSV QDYGAFVDIG GVEGLVHVSE LSWDRVSKPQ
DLLTAGDAVQ VQVLRIDEDP KKGERIGLSV KTLAPRPEPV AAPAGEKPAR PAPPPPPKAG
DVVDVSVDKV ESFGVFVRFA GGRGLVPASE TGTPRGSDLR KSFKVGDGFR ALVLAIDEQH
RIRLSKTGAE EAAERAEAAD YMKKSPRPSG KGFGTLGDLL RQKLEKK