Gene RoseRS_3583 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3583 
Symbol 
ID5210561 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4480442 
End bp4481668 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content61% 
IMG OID640597177 
Productglycosyl transferase, group 1 
Protein accessionYP_001277889 
Protein GI148657684 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATGTCG GTATCGATTT TACGGCTGGC GTCTGGCAGG GGGCCGGCAT TGGGCGGTAC 
ACACGCGAAC TGATCGGCGC CGTTCTTGCT CAGAGTCCTG ATCTTCGCTT CACGCTGTTC
TACGCCGCTG GCTTTCCAGG CGCTGATTCT CCGCCCTATC TGCCTGAGGT GCATCGCCTC
TGCGCCTCAC ATCCGCATAC CCGCGCCGTC CCGATCCCGC TGCCGCCGCG TCGCCTGACG
CAGATCTGGC ATCGGTTGCG CATTCCGCTG CCGATCGAAT GGCTGACCGG TCCGCTCGAT
ATTCTGCACG CGCCTGATTT CGTGCCGCCG CCAACCCGCG CTCGCACCCT CGTCACCATC
CACGATCTCT CGTACATGGT GCATCCCGAG TGCGCAGTTC CGGGAGTCGC CGCTTATCTG
CGCGATGCCG TGCCGCGCGC CTTGAAGCAA GCCAGTATCA TTATTGCCGA TTCGGAGTCG
ACCAGGCGCG ATCTGCATCG ACTGTTGAAC ATCGCCCTCG ACCGTGTGAC GGTGGTCTAT
CCAGGGGTCG ATGCGCGTTT CCGCCCGTTG CCGCCGGACG TATGCGAACC GGTGCGGTGT
CGGTTGAACC TGCCACGCCG TTTCATTCTG TTTGTAGGCA CCATCGAACC GCGGAAGAAC
CTCGTGCGGT TGCTGGAAGC GTTTGCCCGC ATCGACCCGA CGACGGGCGG GGAGGACCTC
TTCCTGGTAC TCGCCGGTCG CCGTGGATGG ATGTATCAAC CGGTGTTCGC GGCCATTGAC
CGGTTGAATT TACATGATCG TGTCCAACTG CTCGATTTTG TGGCGGATTC TGACCTGCCG
GTAGTGTATA ATCTTGCACA GGTATTCGTG TATCCCTCAC TGTACGAGGG GTTCGGCTTA
CCACCGCTCG AAGCGCTGGC GTGCGGTACG CCGGTGGTGA CATCTGACAA TTCGAGTCTC
CCGGAGGTGG TGGGCAATGC CGCTCTCCTG GCGCGCGCCG ATGATGTGGA GGCGCTTTCG
GAGGGGATGA TCCGCCTGTT GAAAGACGTG GCGCTGCGGG ATCGGTTGCG TCAGGCGGGT
CTGGAACAGG TGCGACGGTT TCGTTGGGAA GCGTCTGCCC GACAGATTAT CGAACACTAT
CATACGTTGT CAACGGGAGC ATCGCATGAG GCAACAACCG GAGCTCTCCG GCGAAGCGCT
CGACTCCGAC GAGATGGAGA GCCGTAG
 
Protein sequence
MHVGIDFTAG VWQGAGIGRY TRELIGAVLA QSPDLRFTLF YAAGFPGADS PPYLPEVHRL 
CASHPHTRAV PIPLPPRRLT QIWHRLRIPL PIEWLTGPLD ILHAPDFVPP PTRARTLVTI
HDLSYMVHPE CAVPGVAAYL RDAVPRALKQ ASIIIADSES TRRDLHRLLN IALDRVTVVY
PGVDARFRPL PPDVCEPVRC RLNLPRRFIL FVGTIEPRKN LVRLLEAFAR IDPTTGGEDL
FLVLAGRRGW MYQPVFAAID RLNLHDRVQL LDFVADSDLP VVYNLAQVFV YPSLYEGFGL
PPLEALACGT PVVTSDNSSL PEVVGNAALL ARADDVEALS EGMIRLLKDV ALRDRLRQAG
LEQVRRFRWE ASARQIIEHY HTLSTGASHE ATTGALRRSA RLRRDGEP