Gene Rcas_3339 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3339 
Symbol 
ID5540837 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4356918 
End bp4358135 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content61% 
IMG OID640895456 
Product(Uracil-5)-methyltransferase 
Protein accessionYP_001433407 
Protein GI156743278 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG2265] SAM-dependent methyltransferases related to tRNA (uracil-5-)-methyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0683491 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTATTT CAAATAAACA CTTTCGCCAG CAGATTATTG AGGCAGCGCG GGCAGGCGAG 
ACGACTACGC CGCGTTGCCC ACATGCGCCA CCGCAGGGTC AGTGCGGCGG ATGCGTCTTT
CAAGATCACG ACTACCCGGC TCAGGTGGCA GCAAAACGCG CGGCGCTTTG CAGCCTCTGG
AGCGATGACC TGCCAGACAA TTGTATCGAT ACGCTCGATG TCGTCGCTTC GCCGAACCCG
TTTGCCTATC GCACACGCAT GGATTTTGTG GCGAGCAAGG AGCGATTTGG TCTGCGGCGT
GGCGGCAGGT TCAACTACAT CATCGACCTG CATGAGTGCC ATCTCATCCC AACGCATGCC
TTCACTGCCG CGCGCGCTGT GTACGAGCAC GCAATGGCGC TGGGGTTGCC CGACTACAAT
CTGAAGACCC ATGCCGGTTT TCTGCGGTAT GTGGTCGTGC GGCGCAGCCC CGACGATGAA
CTGCTGCTGG CGCTGGTTAC CGCCGCGCCC GAAGAAGAAA AGGTCTCTGC CGAAAAAGTT
GAGCGTGTGG CGCTGGCAGC CCTTGAACAT CCGGGTGTGC TGGGCGTCCA TTGGCTGATC
AACGCCACCC GCACCGACGT ATCGTTTGGC GAGCCGGTGC GTCACTGGGG GCGCGCAACG
TTGCCAATGC GTGTTGGGGC GCACACGCTC GAAATCGGTC CCAATACCTT CTTTCAGAAC
AATGTCTGGC TGCTGATGCC GCTGCTCGAG GCGGTGCGCG ACGCAGTCGC CGCATGCGGG
CATGCAGGCG CAATCGCCGA TCTATACAGT GGGGTCGGCG CCATTGCGCT TCATATTGCC
AGGCATGCGG ATCGAATTGT CTGTATCGAG TCATCTGGCG AGAGTGTGCG CCTGGCGCGC
GAGAACAGCG TGCGCGCCGG GTTTGAGCAT ATCGCCGTGA TCGAAGCGGA TGTCGCCGAT
GCGCTTCGCG CACAGACGAC CGGCGCATTC GATGTGGTTG TCGCCGATCC GCCGCGCACC
GGTCTGGGTC CTGAGGTCTG TCGCGAGTTG CTGCGATTGC GCCCCCGGCG GATCGTGTAT
GTCTCGTGCA ATCCGCTAAC GCAGCGTGAC GACATCCGCG CGCTGCAATC AGGGTATCGT
CTGGTGTTGC TCCAGGGGTA CGACATGTTT CCACAGACGC CGCATCTGGA GGCGCTGGCG
GTGCTTGATG TTATATGA
 
Protein sequence
MAISNKHFRQ QIIEAARAGE TTTPRCPHAP PQGQCGGCVF QDHDYPAQVA AKRAALCSLW 
SDDLPDNCID TLDVVASPNP FAYRTRMDFV ASKERFGLRR GGRFNYIIDL HECHLIPTHA
FTAARAVYEH AMALGLPDYN LKTHAGFLRY VVVRRSPDDE LLLALVTAAP EEEKVSAEKV
ERVALAALEH PGVLGVHWLI NATRTDVSFG EPVRHWGRAT LPMRVGAHTL EIGPNTFFQN
NVWLLMPLLE AVRDAVAACG HAGAIADLYS GVGAIALHIA RHADRIVCIE SSGESVRLAR
ENSVRAGFEH IAVIEADVAD ALRAQTTGAF DVVVADPPRT GLGPEVCREL LRLRPRRIVY
VSCNPLTQRD DIRALQSGYR LVLLQGYDMF PQTPHLEALA VLDVI