Gene RoseRS_3368 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3368 
Symbol 
ID5210345 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4222696 
End bp4224735 
Gene Length2040 bp 
Protein Length679 aa 
Translation table11 
GC content59% 
IMG OID640596965 
Productglycosyl transferase family protein 
Protein accessionYP_001277678 
Protein GI148657473 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG0438] Glycosyltransferase
[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00886374 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAACGC TCAACTCTGT CGCTCCCACC ATCGTGTTGC CCGCGTGGCA GGCGCTTCGC 
GCGGAACAGG CTGTCACGTC ACACTCGCCG CACGCGCTCA TGGCAGAAGT TCAACCTTCA
ACCTTCAACC CCCAACCTTC AACCTTCAAC CTTCAACCTT CAATCTTCAA CCTTCAGCCT
TCAACCTTCA ACCTTCAACC TTCAACTCCC GACCCCCAAC GCCCACCCAT CACGATCATC
ATCCTCACAT GGAATGGTCT CGAATATACC CGCCGTTGCA TTGAGAGCAT CCGCGCCCAC
ACCCACGATA TAGCGTATCA TCTGCTGGTG GTGGACAACG GAAGTAGCGA TGGAACGCTG
GAGTGGTTGC GCTCACAACC AGAGATCAGG GTGATTGCCA ATGAGCGCAA CCTGGGATTT
GCGCGGGGCA ACAATCAGGG CATGGCAGCC ACTCCACCGG ACCACGACGT GCTGCTGCTC
AATAACGATA CCCTGATCAT TCAGGATCAC TGGCTTGCGC ATTTGAGCGA TGTGGCGCAT
AGCCATCCTG CATATGGCAT TGTCGGGTGC ACGCTGCTGC ATGCCAATGG ATCGCTCCAG
CACGCCGGAA CCTATATGCC GACGGATAGT TTTTGGGGGT GCCAGATCGG TGGCGGCGAA
GCATACATCG GGCAATATCC CGGCGTGCGC GAGGTCGAGG GGATCACAGG CGCGTGCATG
TACATCCGCC ACGACGTGCG CGCGCTGTTG GGCGGTTTCG ATGAAGCGTA CACCTCGTAC
TTCGAGGATA CCGACTACTG TCTGCGGGCG CGTCAGGCGG GGTTCAGGAT CGTCTGCACC
GGCGGCGTTC AGGTGGTGCA TTTCGAGAAT ACCAGCGCCA GGATCAATAA TGCATCCTGG
CAGGCGATGT GGGACAGGGG GCGTGAGGTG TTCGTTCGCA AATGGAAATC TTTCTACGAC
CAGAAGTACC ACCGTGCCAT CGTCTGGCAC TCACTGGTGG CGTCGCCGTC CGGGTATGCG
ACGTCGTCGC GTGAACTTGT GCTCGAACTC GACCGTTGCG GCATCGATGT GCGCCTGGCG
TGCATCTGGG GGAATGATTT CACCGAACCG CTGACGGGCG ATCCGCGCAT CGATCAGTTG
CGCGCCCGTC TCAAAGACTC CAGTCTGCCG CAGGTCGTCT ACCATCAGGG CGACTCGTTC
ATCAAGAATA GCGGACGCTA CCGCATCGGT TATACCATGA TCGAGACCGA CCGGTTGCCG
GATGAGTGGG TCTATCAGGC GAACCAGATG GACGAGGTCT GGACGCCAAC CCACTGGGGG
GCTGAGGTCT TCCGCGCCAG CGGCGTGAAA CGTCCGATCT TCGTTATTCC GCTGGGGATC
AATCCAAACT ATTTTCATCC TGGCATCCAG GGGCGTAAGC CCGGCAATCG ATTCGTCTTC
CTGTCAATCT TCGAGTGGAT CGAGCGTAAA GCGCCGGAAG TGTTGATCCG CGCCTACCAG
CAAACGTTTC GCCGCAGTGA TGATGTGCTG CTGCTGCTCA AGATTTTCAA TTACGATCCC
GGTCTCGATG TGGCGCGACG CCTGGGAGAA CTGATCCAGC GCGACGGTCC GCCGGTGGTG
GTGCTGCCCA ATCAGCAGAT CGCTGCATAC CAGCTCGGCT GTCTCTACCG GAGCGCCGAC
TGCTTCGTGC TGCCAACGCG CGGCGAGGGA TGGGGCATGC CGGCGCTGGA AGCAATGGCG
TGTGGACTGC CGGTTATTTC CACCAATTGG GGTGGGCAGA CGGCGTTTCT CAACGCAGAT
GTCGCCTATC CGCTGCAGGT GCGCGGTCTC GTCCCGGCGG AAGCGCGCGC CCCATACTAC
CGCGGGTTAC GCTGGGCAGA CCCCGACATT GACCACCTTT GTGCACTGAT GCGCCACGTG
TATGAACATC CTGACGAAGC GCGCGCGGTA GGAGCGCGGG CAGCCGTCGA AGTCGCTGCA
CGCTGGACAT GGGCGCATGC GGCTGCGGCG ATCATCGAGC GTCTGGAAGC GATTGAATAA
 
Protein sequence
MKTLNSVAPT IVLPAWQALR AEQAVTSHSP HALMAEVQPS TFNPQPSTFN LQPSIFNLQP 
STFNLQPSTP DPQRPPITII ILTWNGLEYT RRCIESIRAH THDIAYHLLV VDNGSSDGTL
EWLRSQPEIR VIANERNLGF ARGNNQGMAA TPPDHDVLLL NNDTLIIQDH WLAHLSDVAH
SHPAYGIVGC TLLHANGSLQ HAGTYMPTDS FWGCQIGGGE AYIGQYPGVR EVEGITGACM
YIRHDVRALL GGFDEAYTSY FEDTDYCLRA RQAGFRIVCT GGVQVVHFEN TSARINNASW
QAMWDRGREV FVRKWKSFYD QKYHRAIVWH SLVASPSGYA TSSRELVLEL DRCGIDVRLA
CIWGNDFTEP LTGDPRIDQL RARLKDSSLP QVVYHQGDSF IKNSGRYRIG YTMIETDRLP
DEWVYQANQM DEVWTPTHWG AEVFRASGVK RPIFVIPLGI NPNYFHPGIQ GRKPGNRFVF
LSIFEWIERK APEVLIRAYQ QTFRRSDDVL LLLKIFNYDP GLDVARRLGE LIQRDGPPVV
VLPNQQIAAY QLGCLYRSAD CFVLPTRGEG WGMPALEAMA CGLPVISTNW GGQTAFLNAD
VAYPLQVRGL VPAEARAPYY RGLRWADPDI DHLCALMRHV YEHPDEARAV GARAAVEVAA
RWTWAHAAAA IIERLEAIE