Gene RoseRS_3654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3654 
Symbol 
ID5210632 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4570046 
End bp4572016 
Gene Length1971 bp 
Protein Length656 aa 
Translation table11 
GC content60% 
IMG OID640597247 
Productpolysaccharide biosynthesis protein CapD 
Protein accessionYP_001277959 
Protein GI148657754 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.772925 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACGAT CATTGCCTGC CGGACATCTC TGCACAATCG ATGTGTTGAT GTTGACTATT 
GCGGCGTTCG CCAGTTATGC ACTACGTCTC GAGCGTCTCG ATCTGGGTGA GCACTGGCGG
TCGTTCGTGC TGTTTGCCGG AGCAGCGCCG GTTGTTGTGC TGATGCTCTT TGGGATGACA
CGGGTCTATG CCCAGTACTG GCGCTATGCA TCGTTCCACG AGTTCAGTCT GCTGGCGTGG
GCGCTTGTGT GCGCCGGGAT CGTTCTGGAA GGCATGGTGC TGATCGGGCG CGCCCTGTTT
CCAGCGATAC CGGTTGTCCC GTTGTCGATC CCGCCGATCT TCGTCTTATC CGCGCTGACA
CTGACCGCAT TGCCGCGCCT GATGATGCAC GCGCGATTTC AACCTTCCCC CCGGCGCTGG
CATCGACAGA GCGGTAATCG TGCGCTGATC ATGGGCGCCG GTGAAGCCGG CGCAATGATT
GTGCAGAGTA TGCGTCTTGC CCGGCAGAAT AATGTTATTG TGGGCTTTGT CGATGATAAC
CCGCACAAGC GAGGCGTGCG CATCAATGGC GCGCCGGTGC TCGGTGATCG TCACGATATT
CCGCGTCTGG CGGCAGAGTA TCAGATCAAC GAGGTGATCA TCGCCATGCC GAGTGCGCCG
GGCAAAACCA TCCGTGAGAT TGTTGCGATC TGTGAACGTG CCGGTGTGCG CGCTCGCATC
ATCCCCGGAA TAGCCGAACT GGTCGATGGT CGGTTCAGCG TCAATCATAT CCGCGATGTG
CAGATCGAAG ACCTGCTCCG CCGCGCGCCG ATACAAACCG ATATGCAGGC GGTAGGGCGC
CTGATCCGTG GGCGGCGCGT GCTGGTCACC GGCGGGGGCG GATCGATCGG GAGCGAAATC
TGTCGCCACG TGCTGCGGTA CGAACCGTCT GACCTGATCA TTCTGGGGCA CGGTGAAAAC
AGTGTATTCG CCATCCACAA CGAGTTGTAC CGGTGGTTGA ACACGCCACG CGGAGAGTCG
GATAGCGTAG ACGGTGATGG ACAATGCCGG TCATACCGCA CGCCAACGCT GCATACGGTG
ATTGCCGATA TTCGCTTCTC CGAGCGCATT CACGCGGTGT TCGAGCGGTA TCGTCCGGAG
ATCGTGTTCC ATGCCGCAGC GCACAAGCAC GTTCCGCTGA TGGAAGCCAA CCCCGTCGAA
GCGGTGACCA ACAATGTGCT TGGCACGCGC AATCTGCTCG ATGCCGCAAT TGTCACCGGC
GTTGAACGCT TCGTCATGAT CTCGACCGAT AAGGCGGTCA ACCCCACCAG CATCATGGGC
AGCAGCAAGC GTGCCGCCGA ACTGCTGGTG CATCACGCAG CGAAGGTCAG CGGTCGGGCG
TTCATGGCAG TGCGTTTCGG CAACGTCCTG GGCAGTCGCG GCAGTGTTGT GTGGACGTTC
AAGCAGCAGA TTGCCGCCGG CGGACCGGTG ACAGTAACCC ATCCAGAGAT GCGCCGTTAT
TTCATGACCA TCCCCGAAGC GGTGCAACTG GTGTTGCAGG CGGCGGCGCT TGGTCGGGGC
GGCGAGGTGT TTACGCTGGA CATGGGTGAG CCGGTCAAGA TTCTCGATCT GGCGCGCGAT
ATGATCGAAC TCTCCGGGTT GCAGGTAGGG CGCGACATCG ATATCGCCTT TGTAGGGTTG
CGCCCAGGCG AGAAACTCTT TGAGGAACTG TTCCTGCCCG GCGAGCAGTA CGACCGCACA
AGCCACGAGA AGATTTTCAT TGCCAGGAAT GCCAGCCGGC TTGTCCCCGC CGATGTGCTC
GCGCTGATCG CCGATCTTGA AGAGGCGGCT CTGTCGGACG ATACGTCACG CACCGTCCGG
TTGCTCCGTC TCATCGTTCA GCGCAGTCAA TCGACGCCAC ACGAGGATGT GCACGGCGAT
CATACGCTCG AGCCTGCCAG CCTGCGCGCG CTCGCAGTGG GTGGGTCGTA G
 
Protein sequence
MSRSLPAGHL CTIDVLMLTI AAFASYALRL ERLDLGEHWR SFVLFAGAAP VVVLMLFGMT 
RVYAQYWRYA SFHEFSLLAW ALVCAGIVLE GMVLIGRALF PAIPVVPLSI PPIFVLSALT
LTALPRLMMH ARFQPSPRRW HRQSGNRALI MGAGEAGAMI VQSMRLARQN NVIVGFVDDN
PHKRGVRING APVLGDRHDI PRLAAEYQIN EVIIAMPSAP GKTIREIVAI CERAGVRARI
IPGIAELVDG RFSVNHIRDV QIEDLLRRAP IQTDMQAVGR LIRGRRVLVT GGGGSIGSEI
CRHVLRYEPS DLIILGHGEN SVFAIHNELY RWLNTPRGES DSVDGDGQCR SYRTPTLHTV
IADIRFSERI HAVFERYRPE IVFHAAAHKH VPLMEANPVE AVTNNVLGTR NLLDAAIVTG
VERFVMISTD KAVNPTSIMG SSKRAAELLV HHAAKVSGRA FMAVRFGNVL GSRGSVVWTF
KQQIAAGGPV TVTHPEMRRY FMTIPEAVQL VLQAAALGRG GEVFTLDMGE PVKILDLARD
MIELSGLQVG RDIDIAFVGL RPGEKLFEEL FLPGEQYDRT SHEKIFIARN ASRLVPADVL
ALIADLEEAA LSDDTSRTVR LLRLIVQRSQ STPHEDVHGD HTLEPASLRA LAVGGS