Gene RoseRS_2020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_2020 
Symbol 
ID5208982 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp2508038 
End bp2509336 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content60% 
IMG OID640595627 
Productmonogalactosyldiacylglycerol synthase 
Protein accessionYP_001276356 
Protein GI148656151 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0707] UDP-N-acetylglucosamine:LPS N-acetylglucosamine transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00457067 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCCGATG TTTCACATGA AGCAGTGGCG CCCGGCGGCA AGCCGCGCAT TCTGTTTGCG 
ATCTCGGATA CCGGCGGCGG TCACCGCTCG GGAGCGCAGG CAATCGCCGC TGCGATTGAA
CAGCAGGTCG GCGAAGCAGT CGAAACATAT ATCATTGATA TCTTCGCTCA CACCGGCGTG
CCGGTGGTTC GGAACGCCCC GGTGGTCTAC GATAAACTCT CGACGCGCTG GCTCCCGCTC
TATGATGCCC TTTATCGCAT GACTGACGGT CGCCGGCGTA TCGATGCATT GACCGGCGTT
GTCTATCTGG CGGCGCATCG CAATATTTTG CGTGTGCTCG AAGCCGTGCG CCCGACGCTC
GTGGTTTCGG TGCATCCGTT GCTTAACCGT CTGGTTGGCA ATGCGCGCCG CACGTATCGC
CTCTCGTTTC GCTTTATCAC CGTTGTGACC GACCTGGTCA GCCTGCATGC ATCATGGGCC
GACCCGAGCG CGGAATTGTG CATTGTGCCG ACCGATGAGG CGTTCGAGCG TATGCTGCGT
CTGGGAATGC CGCCAGAGAA ACTGATGCGC ACCGGGTTCC CGGTGCATCC GAAGTTTGCG
GCATACCATC GGACGCGCGA TGAAGCGCAG ACGATCCTGG GCCTCTCGCC AGAGTTGTTT
ACCGTTCTGG TGACGAGCGG CGGGGTTGGA TCCGGCAATA TGGAGCAACT GGTGCGCAAT
ATCCATACTG CGTATCCACA GATCCAGTTG CTGGCAGTGA CCGGCAGAAA CAGTGCGTTG
CGTGAACGGC TCGAGAAGAG TGGTTTCGGT CCGAATGTGC ATATCTTCGG CTTCGTCACG
AATATGGAAG AGTTGATGGC GGCGAGCGAT ATTGTGATCT CGAAGGCGGG TCCGGGCACG
CTGATGGAAG CGCTGGTGAT GCGCCGTCCG GTGATTGTGA CGCAGGCGGT TGGGATGCAG
GAACGTGGCA ATATCGATTT TGTGCTGAAC CATGAACTGG GTCTGTTTTG CCCGACGATC
GATCGGATTG TGCCGGTGCT GGCAGAGTTG ATGGAACCGT CAACGTATGC AGCGACGGCT
GCGCGCCTGG TCGATGCCGT TCCGCGCGAT GGCGCGATGC AGATTGCATC TATTCTGCTT
GAACAGTTGC ACCTTGAGCC GCCGGTCCGT CGGTATCGAC GCTTCCGCCT CCCTTCTGTA
CGGATGCTGC GCCCGCGTGC GATTGTGCGG CGGCTGCGCC TGCCGCGGGT GAGAGGACTC
GTGCGCTGGC GGAGACCGTT GCCAGGGAGA CGAAGGTGA
 
Protein sequence
MPDVSHEAVA PGGKPRILFA ISDTGGGHRS GAQAIAAAIE QQVGEAVETY IIDIFAHTGV 
PVVRNAPVVY DKLSTRWLPL YDALYRMTDG RRRIDALTGV VYLAAHRNIL RVLEAVRPTL
VVSVHPLLNR LVGNARRTYR LSFRFITVVT DLVSLHASWA DPSAELCIVP TDEAFERMLR
LGMPPEKLMR TGFPVHPKFA AYHRTRDEAQ TILGLSPELF TVLVTSGGVG SGNMEQLVRN
IHTAYPQIQL LAVTGRNSAL RERLEKSGFG PNVHIFGFVT NMEELMAASD IVISKAGPGT
LMEALVMRRP VIVTQAVGMQ ERGNIDFVLN HELGLFCPTI DRIVPVLAEL MEPSTYAATA
ARLVDAVPRD GAMQIASILL EQLHLEPPVR RYRRFRLPSV RMLRPRAIVR RLRLPRVRGL
VRWRRPLPGR RR