Gene RoseRS_4533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4533 
Symbol 
ID5211518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5682194 
End bp5683267 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content62% 
IMG OID640598111 
ProductDRTGG domain-containing protein 
Protein accessionYP_001278814 
Protein GI148658609 
COG category[R] General function prediction only 
COG ID[COG0857] BioD-like N-terminal domain of phosphotransacetylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTACCC TCTACGTTGC ATCGACCGAA ACATTTGTCG GCAAAAGCGC CACGTGCGTG 
GGGTTGCTCA CGCGCGCGCA GCGCGATGGA TTCAGGATCG GCTATATGAA ACCGGTCAGC
GTTTCGGTGA TCCGCACCGA AACCGGAGTG CACGACGATG ATGCAGCGTT CATTCGCGAT
CACTTCGCGC TGCCTGATCC GCTGGAACGG GTAGCGCCGG TGCTCGTGAC GCAGGGGGTG
GTCGAACAGA TTATGCGCGG GCAGACAACT ATCGACTTTG CGCGGCGGTT GCGTGACGCC
TATCTGGCTA TCTCGCGCGA TCGCGACCTG GTGGTGGTGG AAGGCGCCAA TACCTGGGCA
GAAGGGTCGG TTGTCGATCT CTCCGCCGAT CAGGTCTCCG ATATGCTGGA AGCGCCGGTT
CTGCTCGTGA CGCTCTATCG GTCACCGCTC TCGCTCGACG CGATTCTGGC AGTGCAACGC
TACCTGGGCG ACCGGTTGCT GGGGGTGTTG ATCAACGAAG TCGAAGCGCC GAAGATCGAC
TTCGTGAAGA ATCGGGTGGC GCCGTTCCTG GAGCAGCGCG GCGTGCCGGT TTTCGCCGTC
CTTCAGCACG ACCCGCAACT CGCCAGCGTG ACCGTCGCCG ATCTGTTCGA GTTCCTGGGC
GGGCAGATGA TCGGGCGACC GGAGTGGTGC GAGCGACAGG TGGAACACCT GGTGATCGGC
GCGATGGGCA GCGCCGCAGC CCTCTCGCAC TTCCGTCGTC GCGCCAATAA AGCGGTTATC
ACCGGCGGCG ACCGCGCCGA CCTGCAACTT GCAGCACTGG AGACCTCAAC TTCGGTGCTG
GTGCTGACGG GCAATATCCG CCCGCCCGCG ACTGTGCTGG ATAAGGCGGA AGAGCAGAAG
GTGCCGGTCA TCATCGTCGC CGACGACACC CTGACAACCG TTGAACGCAG TGAACGCGTC
TTTGGGCACA TTCGTTTCAA ACAGGCAGCC AAAATCGCCC GCTTTAGCCA GATGCTCGAC
GAATCGTTCG ACTTTGACCG CCTGTATGAT CAATTGGGGT TGGTGCGGGT GTGA
 
Protein sequence
MATLYVASTE TFVGKSATCV GLLTRAQRDG FRIGYMKPVS VSVIRTETGV HDDDAAFIRD 
HFALPDPLER VAPVLVTQGV VEQIMRGQTT IDFARRLRDA YLAISRDRDL VVVEGANTWA
EGSVVDLSAD QVSDMLEAPV LLVTLYRSPL SLDAILAVQR YLGDRLLGVL INEVEAPKID
FVKNRVAPFL EQRGVPVFAV LQHDPQLASV TVADLFEFLG GQMIGRPEWC ERQVEHLVIG
AMGSAAALSH FRRRANKAVI TGGDRADLQL AALETSTSVL VLTGNIRPPA TVLDKAEEQK
VPVIIVADDT LTTVERSERV FGHIRFKQAA KIARFSQMLD ESFDFDRLYD QLGLVRV