Gene RoseRS_4167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4167 
Symbol 
ID5211151 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5218431 
End bp5220377 
Gene Length1947 bp 
Protein Length648 aa 
Translation table11 
GC content60% 
IMG OID640597756 
Productpolysaccharide biosynthesis protein CapD 
Protein accessionYP_001278461 
Protein GI148658256 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCAGC GACTTCAAAA CCGGCATTTT CTCCTTTTCG ATATCCTGCT CGTGCCGCTT 
GCGATCTACC TGAGTTTCGT CCTGCGGCTG GAAATGTTCA ATCTCGGCAG TTACTGGCTG
GTATGCATGC AGTTCTGCCT GACGGCGGTC GTGACCACCC CCCTGGTGTT TCGCGCGCTG
GGGATCTACC GTCGCTACTG GCGCTACGCT TCGTTTGAAG AACTGCTGCT GCTCTGTAGT
GCAACGTCGA TTGCGCTGGC GCTTGCCACG CTTGTGTTTA CTCTGATCGA TGCGCTCTTG
CCGGTGGTCG CAACGATGCC GCGCTCCATT CCTTTTATCG TTCCGCCCAT CGCAGCCACG
CTTATCAGTG TGCCACGGTT GCTGGTGCGC ATCGGTGCAG CGCGCGAGCG TCGGCGTCGT
GCAACTGACC GACCGGCGCC GGTGTTGATC ATGGGCGCTG GCGATGCTGC GTCGATTATT
GTGCGTGAGA TTCAACGCAA TCCAAAACTC GGCATGGAGG TTGTCGGGCT GCTGGACGAC
GATCCGGCGA AGCGTGGGCT GCGGTTGCAC GGCGTCGAAG TGATGGGTGA CCGCCACGCT
ATTCCGACAC TGGTAGCCCG CCACAAGGTG CGTCAGGTAA TCATTGCGAT GCCAGGCGCG
CCTGGTAAGG CAGTACGCGA GATTATGCAT ATTTGCGAGT CTGTTGGTGT GACAGTGCGC
ATCATGCCCG GGGTTCACGA ACTGATCGAC GGAACGATCA GCGTCAGCAA ACTGCGCAAC
ATCCAGATTG AGGACCTGCT GCGCCGTGCG CCGGTGCAAA CCGATACCGC GGCAGTGCGC
GCGCTGATCG CCAACCGACG GGTCCTGGTA ACCGGCGGCG GCGGTTCCAT CGGCAGCGAA
CTGTGCCGCC AGTTGATCCG CTGCGGTCCA TCGCACCTGA TTGTGCTTGG TCACGGCGAA
AACAGTGTGT TCGAGATCTG CAACGAACTT CAGCGTCTGG CAGAAGCGCA CGCCGGTCAA
TCGCCGCACA TTGTGCCGGT GATCGCCGAT ATTCGTGATC TGGAACGCCT GCGCGCGGTG
TTCGAAATGC ATGCGCCGGA ACTCGTTTTT CACGCAGCCG CACACAAACA TGTTCCACTG
ATGGAGGAAC ATCCGGTCGA AGCCATCAGC AACAATGTCA TCGGCACGCG CAACCTGCTC
GACGTATCGC TCGAAACCGG CGTCGAACGG TTTGTGATGA TCTCATCGGA TAAGGCGGTC
AATCCGACGA GCGTGATGGG CGCAACCAAG CGCATTGCCG AGATGCTGGT GCTCAACGCT
GCGCGGATCA GCGGACGACC CTACGTGGCG GTGCGTTTTG GGAATGTGCT GGGCAGTCGT
GGCAGTGTCG TGCTGACCTT CAAACGGCAG ATTGCCGCCG GTGGACCGGT AACGGTCACG
CATCCGGAGA TGCGTCGCTA CTTCATGACC ATTCCAGAAG CGGTGCAACT GGTGCTCCAG
GCGTCGGTAC TGGGGCGCGC CGGCGAGATT TTTATGCTGG ACATGGGGGA ACCGGTGAAG
GTGGTCGATC TGGCGCGCGA CATGATCCGT CTGTCGGGAT TGGAGGTCGG GCGTGATATT
GATATCTGCT TCACCGGCAT ACGTCCGGGT GAGAAATTAT TTGAAGAATT GTTCGCCCAC
GGTGAAGAAT ATCAGCCAAC AGCGCACAGC AAAATCTTCA TCGCCGCTGG CGCCAGCAAC
AATATTCCGC CCGACTTGCG CACGGATGTA GCGCTGCTCG AACAGGTTGC GCGCGCGAAC
GACGATGCCG CCGCACGACG CATGCTGCGC CACATCGTCC CGGAGTACTG CCCGCCGTTG
CCTGCCCCGC CGATACCTGT CGCTGAAAAT ACGCCCTATC CTGTGCTGGT GCGTCCATTG
CAACCGCTGA TCGGGGGTGG ACGATGA
 
Protein sequence
MMQRLQNRHF LLFDILLVPL AIYLSFVLRL EMFNLGSYWL VCMQFCLTAV VTTPLVFRAL 
GIYRRYWRYA SFEELLLLCS ATSIALALAT LVFTLIDALL PVVATMPRSI PFIVPPIAAT
LISVPRLLVR IGAARERRRR ATDRPAPVLI MGAGDAASII VREIQRNPKL GMEVVGLLDD
DPAKRGLRLH GVEVMGDRHA IPTLVARHKV RQVIIAMPGA PGKAVREIMH ICESVGVTVR
IMPGVHELID GTISVSKLRN IQIEDLLRRA PVQTDTAAVR ALIANRRVLV TGGGGSIGSE
LCRQLIRCGP SHLIVLGHGE NSVFEICNEL QRLAEAHAGQ SPHIVPVIAD IRDLERLRAV
FEMHAPELVF HAAAHKHVPL MEEHPVEAIS NNVIGTRNLL DVSLETGVER FVMISSDKAV
NPTSVMGATK RIAEMLVLNA ARISGRPYVA VRFGNVLGSR GSVVLTFKRQ IAAGGPVTVT
HPEMRRYFMT IPEAVQLVLQ ASVLGRAGEI FMLDMGEPVK VVDLARDMIR LSGLEVGRDI
DICFTGIRPG EKLFEELFAH GEEYQPTAHS KIFIAAGASN NIPPDLRTDV ALLEQVARAN
DDAAARRMLR HIVPEYCPPL PAPPIPVAEN TPYPVLVRPL QPLIGGGR