Gene RoseRS_1104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_1104 
Symbol 
ID5208051 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp1379801 
End bp1380991 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content60% 
IMG OID640594717 
Productglycosyl transferase, group 1 
Protein accessionYP_001275461 
Protein GI148655256 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0552505 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00694846 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGATTA CTCTGATCGG TCCGACCTAC CCGTTCCGTG GCGGCATTGC GCACTATACG 
ACCCTGCTGA CGCATCATCT GCGCCAGCGC CACGATGTGC GGTTGATTTC CTACATCAAG
CAGTATCCGA AGTGGCTCTA TCCCGGCAAC ACAGCAATGG ATCCCAGCCC CGACGAAAGC
GCGCTGCGTG TGGAGTGCGA CCGGGTGCTG ACGCCAATGA ATCCGCTGAC ATGGTGGCGC
GCCTTTCGTA TGATCCAGCG CGATAACCCG GATCTGCTGC TGCTCCAGTG GTGGACGCCG
TTCTGGTCGC CGATGCTGTT TGTGCTGACC CGTCTGGTGC GACGTTATAC GAATGTGCGC
ATTCTGTTCT TGTGTCATCA CGTCATCGCT CCCGACGGCG GCATGTTCGA CTGGTATCTG
GCACGGCGCA TTCTGTGGCG CGGTCACGCA TTCATTGTGA TGAGCGAGGA GGATTTTGCC
CTGCTCCGGC GTGCGCTTCC GTGGGCGCGC ATCCGGGGTG TCACCCATCC ACCCTACGAT
GTGTTCAGTC GCACATCGCT ACTGCGCGTC GAGGCGCGCG CCCGCCTTGG GTTGGATCCG
GATGAACCGG TGCTGCTCTT CTTTGGCTTC GTGCGACGCT ACAAAGGGTT GCGTCACCTG
ATCCAGGCGC TGCCGCTGAT ACGACAGCAT GTTCCGGTGC GTTTGCTGGT CGTTGGCGAG
TTCTGGGAAG ATGACCGCCC CTACCGCGAA CTGGTGCGGA ACCTGAATCT TGGTGATGTG
GTGCATTTCC ACAGCGAATA TGTCCCCAAC GAGCAGATTG CAGTCTACTT CTCCGCCTGC
GATGCGGTCG TTTTGCCCTA TCTGGAAGCG ACCCAGAGCG GCGTGGCGCA ACTGGCGATC
GGGTTCGAGA AGCCGATGAT CGCCACCTCT GTCGGCGGTA TGCCCGAAAC GATTCATAAC
GGTGAAACCG GGTTGATCGT GCCTCCCGGT GACAGTGCAG CGCTGGCGGA TGCAGTGGTG
CGCTTTTTTC GCAATGGACT GGCTGAGCCG TTCACCCGGA ATATCCGCAC GGTGCGCGAG
CGCGACTCCT GGCTGCCGCT GGTGCATCTG ATCGAGGAAC TGGCAGAACC GGCGAGTGCG
CCGCAAACTG AACAACCTGC GCCGCAAACA GCGTCGCCAA GAGTGCTGTA G
 
Protein sequence
MKITLIGPTY PFRGGIAHYT TLLTHHLRQR HDVRLISYIK QYPKWLYPGN TAMDPSPDES 
ALRVECDRVL TPMNPLTWWR AFRMIQRDNP DLLLLQWWTP FWSPMLFVLT RLVRRYTNVR
ILFLCHHVIA PDGGMFDWYL ARRILWRGHA FIVMSEEDFA LLRRALPWAR IRGVTHPPYD
VFSRTSLLRV EARARLGLDP DEPVLLFFGF VRRYKGLRHL IQALPLIRQH VPVRLLVVGE
FWEDDRPYRE LVRNLNLGDV VHFHSEYVPN EQIAVYFSAC DAVVLPYLEA TQSGVAQLAI
GFEKPMIATS VGGMPETIHN GETGLIVPPG DSAALADAVV RFFRNGLAEP FTRNIRTVRE
RDSWLPLVHL IEELAEPASA PQTEQPAPQT ASPRVL