Gene RoseRS_2649 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_2649 
Symbol 
ID5209618 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp3285292 
End bp3286665 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content63% 
IMG OID640596251 
Producthypothetical protein 
Protein accessionYP_001276973 
Protein GI148656768 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1207] N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGATG AACCGATTGT TTTCTTCGAA GACGAAGGAT GCCGCACCTT TCTGCCATTG 
ACGCATACAC GACCTGTTTG CGATCTGCGC TGCGGCATCT TTACTCTGCG CGAGCGGGTG
CGCACGCTGA CGGGCATGAC GCCAGCGGTC ATCTGCCGTT CTCACCTGGC GCGGGCATAT
GGCGTCGGGC GCTGGCCCCT CACGCTGCTC AGTCGGAGTA CACCGCTCAC CTTTGTGAAT
GCGCGTGCGC TCGATGCGGG ATGGATCTTC GATCTACTTG ATGAACCGGT AGGAACGGTG
TACCTGACCG ACGCGGGACA TGCTCTCCTC GGCGGACCGG TTCTGTTGGG TGCGCGTCTG
ACGCCGTGGA TGGCAAGTGC AGTGCTCCCC TACCTTCTTG AGCAGCGCGG CGCGGCGGCG
CTGGCGGAAC TGCGTCGTAT CGGGCGTCTC GTCGAGATCG AGACACGCCT GCTCACCTTC
CCCTGGGACC TGATCGCGCT GAACGGCGAA CAGATCGTGC GTGATGTGCC GCTCGTCGTC
AGGCAGGACG GCTGGATCTG CGCGGCTGAC CAACCGCCTG CCCATCCATC GATTGTCGTC
AGCAACCCGG CGCACGTCTT CATCCACCGC GATGCGCGCC TGGAACCGCC GCTGGCGCTC
GATGCGCGCG ATGGTCCAAT CGTGATCGAT GCTGCACGGA TTGAACCGTT TTCGTTCATC
CAGGGACCGG CCTGGATCGG TCCTGGCTCC CTGATCGCCA GTGCGCGCAT ACGCGGCGAA
ACAAGCATCG GACCTGTCTG CCGTATCGGC GGCGAGGTTG AGGCGAGCAT CGTTCAGGGG
TACAGCAACA AGCACCACGA CGGCTTTCTC GGGCATTCGT ACCTTGGTGA GTGGGTCAAC
ATCGGCGCCA TGACCACCAA CAGCGATCTG AAGAACACCT ACGGCACAAT TCGCATGGTG
ATCGAGGGGT TTGGTCAGAT CGACAGCGGC ATCCTGAAAC TGGGGTGTTT CCTCGCCGAT
CACGTCAAAC TGGGGATCGG GGTCCACCTG AACGGTGGCG CCGTCATCGG CACCGGTTCG
AACATTTTTG GGGTTCACTT TGCGCCCAAG ACCATTCCCC CCTTCACCTG GGGCGGTGAG
GTGTTCCGCG AGTACCGCAT CCAGTCGATG ATCGACGTGG CGCGCAAAGT TATGGCGCGT
CGCAAGGTAA GCATGAGCGC TGAGCAGGAA GAAGTGCTCC GCGCTGTGTT TGCCATGACG
CGCGGCGACC GCGCCGGGTT GGACGACGGC GGCGGACGCG ATGAAGCAGC GCTGCGGCGA
GCGGAGGCGG AAGCGGTGCG CGCCTTCGAC CTGGTCGAAG CGTCGGGGGG GTGA
 
Protein sequence
MTDEPIVFFE DEGCRTFLPL THTRPVCDLR CGIFTLRERV RTLTGMTPAV ICRSHLARAY 
GVGRWPLTLL SRSTPLTFVN ARALDAGWIF DLLDEPVGTV YLTDAGHALL GGPVLLGARL
TPWMASAVLP YLLEQRGAAA LAELRRIGRL VEIETRLLTF PWDLIALNGE QIVRDVPLVV
RQDGWICAAD QPPAHPSIVV SNPAHVFIHR DARLEPPLAL DARDGPIVID AARIEPFSFI
QGPAWIGPGS LIASARIRGE TSIGPVCRIG GEVEASIVQG YSNKHHDGFL GHSYLGEWVN
IGAMTTNSDL KNTYGTIRMV IEGFGQIDSG ILKLGCFLAD HVKLGIGVHL NGGAVIGTGS
NIFGVHFAPK TIPPFTWGGE VFREYRIQSM IDVARKVMAR RKVSMSAEQE EVLRAVFAMT
RGDRAGLDDG GGRDEAALRR AEAEAVRAFD LVEASGG