Gene RoseRS_4074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4074 
Symbol 
ID5211057 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5106693 
End bp5109014 
Gene Length2322 bp 
Protein Length773 aa 
Translation table11 
GC content65% 
IMG OID640597662 
Productglycosyl transferase, group 1 
Protein accessionYP_001278368 
Protein GI148658163 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase
[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGAAA CGGAAGATGG CACGCAGGGG CTTGCCAAAA AACCTACCCC TGTGAGGTTC 
CCCCCCCACC CCCTCGATCC GGCACGCCCG GTCTATGGCT ACGCACCTGC CGATCCATCG
GCGCAGCCGG TGGTCAGCAT TGTGACCCCC TGCTACAACG CAGGGGCAAT GTTCCTCGAT
ACCATCGCCT CGGTGCTGCG CCAGTCGTTG CAGCAGTGGG AGTGGATCAT CGTCAACGAT
GGGTCGGATG ATGCGGCGAC GCTGCGGGTG CTGGGTGCGC TGCGCGCCTC GCCTGACCCG
CGGATTCGGG TGGTCGATCA GCCGAACTAC GGTCCGGCGG CAGCGCGCAA TGCTGGCGTC
GCAGTCAGTC GCGCGCCGCT GCTCTTCTTC CTCGACAGCG ACGATCTGCT GGCGCCGACG
GCGCTGGAGC AACTCGCCTG GGCGCTGACG GCGCACCCGG AGGTTGCCGC AGTCACAACC
TGGTACGTGC TCTTTGGCGC GATGCAGGGC CTGTTACGGC GTGGGTTTGC GTCGCGCCAT
ACGTTTCCCC ACGACAATCC GCTGACGGTC AGTGTGATGC TGCGCCGCAC GGCGTTCGAG
CGCATCGGCG GGTTCGATGA GAGTCTGCGC CATGGGCTGG AAGATTATGA GTTCTGGGTG
CGCCTGGCGG ACGCCGGGAT GTGGGGATGC AACATCCACG AGGCGCTGGT CTGGATCCGT
CGCAAGTCGG CAGACATGTA TCGCGGGTAT CGCTGGAACT TTCAGACTGA TCGCGCGGCG
CTGGCACGTA TGCGCCGTGC CCTGCGCGCG CGCTATCCGC GTGTGTTCCG CGACGGTCCG
CCGCGTCCGC CGGGTGAGCC GTCGCCGATC CTGCAACCGC ATCCGTTGAT CATGCCCGAC
CCGCCGTTCG AGAATCGTCT GGCGCCACGG GGTGAGCGTC GGGTGTTGCT GCTGCTGCCC
TGGGTCGAGG TAGGGGGAGC GGATCGCTTC AGCATCGATC TGGCGGATGG GTTGCGCGCG
CGGAATTGCC GCGTGACGGC GTGCCTGTTG CGCCCGTCGG CGAATCCGTG GAAGCATGAA
CTGATCATGG CGGCGCACGA GGTCTTCGAT CTGCCGCTGT TTCTGGCGTT TGCCGATTAT
CCGCGCTTTC TGCGGTACCT GGTCGAGTCG CGCGGCATCA CGACGGTGGT GGTGCATAAC
GATCTGTTCG CCTATCGTTT GTTGCCATTC CTGCGCGCCT GGTGTCCACA GGTGACGGTG
ATCGATTTTC TGCATATTGT GCAGGATCAT TACCACGGCG GCGTCCCGCG CGCGGCGCTG
GAGTACCGTT CACTGATCGA TCTGCACGTT GCGGCGTCGC ATCAGGTGCG CGAGTGGATG
GTTGCACACG GCGCCGATCC AGAACGGGTC GATGTCTGTT CGATCAATGT GGATACGCAG
CGCTGGAAGC CCGATCCTGC GCTGCGCGAC CGGGTGCGCG CGGAGTTGGG GTTGCGCGTC
GATGAACCGG TCGTGTTGTT TGTCGGGCGC CTGGTTCCGC AGAAACGCCC GCGCCTGGTT
GTGGAGATTG CGCGCGCGCT GGTGGAGCGT GGCGTCCACT GCACCTTCCT GGTGATCGGG
GATGGTCCCG ACAGGGGATG GATGCAGCGC TTCGTGCGCC GACACCGGCT TGAAGGTCGT
GTGCGCCTGT TGGGTTCGGT GTCGTCGACG CGGGTGCGTG AGATGATGGC TGCTGGCGAC
CTGCTGGTGC TGCCGTCGGA GAGTGAAGGA ATCGCATTTG TGCTATTTGA AGCGATGGCG
ATGGCGCTTG TGCCGGTCGC TGCCGATGTT GGCGGGCAGC GCGAATTGGT GACGCCGGAG
TGCGGCGTGC TGATCCCGCC GGGAGGCGAT CAGGTTGCAC AGTATGTCAC TGCGCTGGAG
CGTCTGATCG CCGATCCGTC GCAACGTGCG GCGATGGCGA TGGCGGCGCG CACACGGGTG
GTGGAACGGT TCGACCGGCG GCAGATGATT GATTGCATGC TGGCGCGGAT CGAGCGGGCG
GAGACGCTGG CGCGCACCGC GCCGCGCCCG CTGGTTGATC GCGATCTGGG GCTGGCGAGC
GCGTCGCTGG CGATCGAGTA TCTCCAGTTC CGTGAGGCAT TGCTGCGGCT TGCGCCGGTG
CGCCGGGCGC GTGCAGTGCG CTGGTCGTCC GCCTGGCTCC GGCTGGCGCG TCTAATGCGG
ATGCGGGCGT GGGTTGACCG TTTTGATCGG CGGATCTATG TGCTGCGCCG GGAGGTTATG
TGGCGGATCA AGCGCGCGCT GGGGAGAGCG TATAATCGGT GA
 
Protein sequence
MAETEDGTQG LAKKPTPVRF PPHPLDPARP VYGYAPADPS AQPVVSIVTP CYNAGAMFLD 
TIASVLRQSL QQWEWIIVND GSDDAATLRV LGALRASPDP RIRVVDQPNY GPAAARNAGV
AVSRAPLLFF LDSDDLLAPT ALEQLAWALT AHPEVAAVTT WYVLFGAMQG LLRRGFASRH
TFPHDNPLTV SVMLRRTAFE RIGGFDESLR HGLEDYEFWV RLADAGMWGC NIHEALVWIR
RKSADMYRGY RWNFQTDRAA LARMRRALRA RYPRVFRDGP PRPPGEPSPI LQPHPLIMPD
PPFENRLAPR GERRVLLLLP WVEVGGADRF SIDLADGLRA RNCRVTACLL RPSANPWKHE
LIMAAHEVFD LPLFLAFADY PRFLRYLVES RGITTVVVHN DLFAYRLLPF LRAWCPQVTV
IDFLHIVQDH YHGGVPRAAL EYRSLIDLHV AASHQVREWM VAHGADPERV DVCSINVDTQ
RWKPDPALRD RVRAELGLRV DEPVVLFVGR LVPQKRPRLV VEIARALVER GVHCTFLVIG
DGPDRGWMQR FVRRHRLEGR VRLLGSVSST RVREMMAAGD LLVLPSESEG IAFVLFEAMA
MALVPVAADV GGQRELVTPE CGVLIPPGGD QVAQYVTALE RLIADPSQRA AMAMAARTRV
VERFDRRQMI DCMLARIERA ETLARTAPRP LVDRDLGLAS ASLAIEYLQF REALLRLAPV
RRARAVRWSS AWLRLARLMR MRAWVDRFDR RIYVLRREVM WRIKRALGRA YNR