Gene RoseRS_0401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_0401 
Symbol 
ID5207337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp507927 
End bp509120 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content60% 
IMG OID640594027 
Productglycosyl transferase, group 1 
Protein accessionYP_001274782 
Protein GI148654577 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID[TIGR03087] sugar transferase, PEP-CTERM/EpsH1 system associated 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAGGTG CGGCGATGAA CATCCTCTTT GTTGCACCAC GCTTCCCCGA TCCGCTCATC 
CAGGGCGACC GGCTGCGCGC GCGTCAGTTC CTGCGCGTTC TGCGCCGCTG GCACGAGATC
ACCCTCGTCA CACCCGCGAC GCCAGAGCTG CCAGACGCTG CCACCATCGC CGACATGTGT
GACCGGTGGG TTCCAGTGCA TGAACCGCGC TGGCGGGCGC TGTGGCGCGT CGCGTCGCAT
AGTATTGAGA CGCTGCCATT GCAGACAGCG CTGTTCAGTT CGCCAGCGTT GATCCGCACC
GTGCGCGACC TGGCGCATCA CCAATCATTC GACTTACTCT ACCTGCATAC CGCACGGGTA
GCGCCGGTGG TCGACGCCGT CCCGGATCTA CCGAAAGCAA TCGACTTTAT CGACGCGCTG
TCGCTGAATA TGTATCGCCG CGCCCGCTGT CAGGGCGGTA TCACGCACTG GTTGTTCGAT
ATTGAAGCGC GCCGGATGGC AGAATGCGAG CAGTACCTTT CCGAAATATG CGATGTACAG
TTCGTCTCGG CAGTTCGTGA TCGCGCATTC TTAGGACCGA ACGTTCGAGT TGTCAACAGT
GGGGTTGACG TGGCCCAGTT TCCTTATGTA GAACAGGGAC GTCTCAACGA TCTCATTGTC
TTGACTGGCC GCATGGGATA CTTCCCCAAT GCGGACGCCG CAGTTTCGTT TGCTTCAAAC
GTCTTGCCCC TGGTCCGGCA CGAGGTGCCA ACGGCACGCC TGCAGATCGT TGGCGCCGAT
CCGCCGCAGC GCGTACGAGC ACTGGCGCGT TTGCCGGGTG TTGAGGTGAC TGGCCACGTG
CCTCGTATCC AGGACTATCT GCAACGCGCG ACAATCGCAG TGGCGCCGTT ACGCAGTGGC
TCAGGCTTTC AGACTAAAGT AGCTGAAGCA ATGGCAAGCG GTACTCCAGT TGTGGCGACG
CCGCACATAC TGGATTCGCT TGATGTTCGT CATGACGAGC ATGTGCTTCT GGCGCACGAT
GATGCTGAAA TGGCAGCGCA AATCGTGCGC CTGCTGCGCG ATGCTGCGCT GCGCCGCCGG
TTGGCGCGGG CAGCACGCGC GCTGGTTGAA CAGCGTTATA CCTGGGAACG CTCGGCCGCA
GCGATCAATG CACATCTCGT TGCGGTCGTT CAGCAAGGCA AAAAGATATC ATGA
 
Protein sequence
MRGAAMNILF VAPRFPDPLI QGDRLRARQF LRVLRRWHEI TLVTPATPEL PDAATIADMC 
DRWVPVHEPR WRALWRVASH SIETLPLQTA LFSSPALIRT VRDLAHHQSF DLLYLHTARV
APVVDAVPDL PKAIDFIDAL SLNMYRRARC QGGITHWLFD IEARRMAECE QYLSEICDVQ
FVSAVRDRAF LGPNVRVVNS GVDVAQFPYV EQGRLNDLIV LTGRMGYFPN ADAAVSFASN
VLPLVRHEVP TARLQIVGAD PPQRVRALAR LPGVEVTGHV PRIQDYLQRA TIAVAPLRSG
SGFQTKVAEA MASGTPVVAT PHILDSLDVR HDEHVLLAHD DAEMAAQIVR LLRDAALRRR
LARAARALVE QRYTWERSAA AINAHLVAVV QQGKKIS