Gene RoseRS_3120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3120 
Symbol 
ID5210089 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp3919976 
End bp3921076 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content65% 
IMG OID640596712 
Productglycosyl transferase, group 1 
Protein accessionYP_001277433 
Protein GI148657228 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0017817 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000176244 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGCGCATTC TGATGCTCTC CAAAGCGCTG GTCAACGGCG CGTACCAGAA GAAGTGCGAA 
GAACTGGCTG CCCTGCCCGA TGTCGAGTTG ATCGTCGCCG TGCCGCCCGC CTGGCGTGAA
CCGCGCGTCG GCGTTATCCG GCTCGAACGA CGGTTTACCG CCGGCTATCA ACTGGTCACG
CTGCCGATGA TGTTCAACGG CCGCCACCAC CTGCATTTCT ACCCGACCTT CGAGCGCCTG
GTGCGCCGGA CGCGCCCCGA TATTGTGCAC GTCGATGAGG AGTCGTTCAA TCTGGCGACG
TTTCTGGCGC TGCGGGCGGG AGTACGGCAC GGGGCGCGCT GCTGCTTCTA CAACTACGCC
AACATCGACC GATTCTATCC GCCGCCCTTC AACCTGTTCG AGCGCTATGC CTTTCGCCAC
GCAGCGCATG CCTTTGCATG CAGCACCGAA GCCGCCGCAA TCATCCGGCG CCACGGCTAC
ACCGGACCGC TCACCATTTT GCCGCAGTTC GGCGTCGATC CCGACCTGTA CGCGCCTGCG
CGGCGCGACC GCCGTAACGC CACGCTGGTC GTCGGCTACA TCGGGCGTCT CGTGCCGGAA
AAAGGGGTGA TCGACCTGGT GGAAGCCGTC GCGCGGGCGC CGTCGGTGCG CTTGCGGTTG
ATCGGCGATG GCGCACTGCG CCCGGCAATC GAGGCGCGGA TCGCCGCACT GGGCATCGGC
GAGCGTGTCG AACTGCACCC TGCCGTCCCA TCGACCCGCG TTCCCGACGA ATTGCAGCGA
CTCGACGCGC TGGTGTTGCC ATCGCGCACC ACGCGCACCT GGAAAGAACA GTTCGGGCGT
ATCCTGGTCG AAGCGATGAG TTGCGCCGTT CCGGTGGTCG GCTCTTCATC GGCAGCCATT
CCCGATGTCA TCGGTGATGC GGGAATCATC TACCCGGAAG GGGAAATTGA CGCACTGGCG
GATGTGCTGC GGCGTCTGGC GGATGATCCG GCGCTGCGCG ACGACCTTGG GCGACGCGGA
CGGGAGCGCG TGCTGGCGCA GTTCACCCAG GCGGCAATCG CCCGACAGTA CCATCACGCA
TACCGTTCGA TGCTGAATTG A
 
Protein sequence
MRILMLSKAL VNGAYQKKCE ELAALPDVEL IVAVPPAWRE PRVGVIRLER RFTAGYQLVT 
LPMMFNGRHH LHFYPTFERL VRRTRPDIVH VDEESFNLAT FLALRAGVRH GARCCFYNYA
NIDRFYPPPF NLFERYAFRH AAHAFACSTE AAAIIRRHGY TGPLTILPQF GVDPDLYAPA
RRDRRNATLV VGYIGRLVPE KGVIDLVEAV ARAPSVRLRL IGDGALRPAI EARIAALGIG
ERVELHPAVP STRVPDELQR LDALVLPSRT TRTWKEQFGR ILVEAMSCAV PVVGSSSAAI
PDVIGDAGII YPEGEIDALA DVLRRLADDP ALRDDLGRRG RERVLAQFTQ AAIARQYHHA
YRSMLN