Gene Rcas_3101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3101 
Symbol 
ID5540597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4018357 
End bp4019826 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content59% 
IMG OID640895220 
Productpolysaccharide biosynthesis protein 
Protein accessionYP_001433173 
Protein GI156743044 
COG category[R] General function prediction only 
COG ID[COG2244] Membrane protein involved in the export of O-antigen and teichoic acid 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0405344 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAATCAA CGACAACATC GCAATCCGAA GGAGTCGGCA AGCGCGCCGT GCGTGGAACG 
TTCTGGTCAT TCCTTTCGTA CACGAGCGGG CGCCTCGTCA CGTTTGTCAC TACGCTGATC
CTGGCGCGTC TGCTGGCGCC GGCGGAGTTC GGCGTGATCG CTTACTGCAC ACTGGTGATC
GCCTATCTCG ACCTGCTTAA TAATTTTGGC GTCGGGCATG CCCTGATTGC GCGACGTGAC
AGGTTGGAAG AAGCCCAGAA TGCGGCATTC GTCGTCAGTA TTGGCAGTAG CGTGTTTCTG
TACGCCGGCG CGTGGATTGC GGCGCCCTCG ATTGCAGTGT TCTTCAATGA GCCGCAGGTA
ACGCCGTTGC TGCGCGTGTT GTCGCTTGGG CTGCTGCTGG TCGGAATCGG CACTGTGCCG
ATGGCTATGC TTCAGCGCGA TCTCCGGTTC AAGGCGTATT TACTTCCCGG GATTGTGCGG
AATATTATCA AAGCAGTGGT TGCCATCAGC ATGGCGTGGC AAGGGTTTGG CGTCTGGAGT
CTGGTGGTTT CAGAACTGGT CAACAAGGTG TTGGAGGTGA TCATTCCCTG GCTGATTGTG
CGCTGGCGAC CAACGCGTGC GTTCGACCCG CAGGTGATGC GCGAGATGTT GGGGTATGGC
GTCCACATTA TGGGGGTCAG TCTGGTTGGC TCCTTTATGG TCAATGTGGA TTATCTGCTG
GTCGGGCGGT TGCTTGGCGC GGCGGCGCTG GGGTACTATA CAATGGCGTT CCGCATTCCC
GAACTGGTCA TTCGCAGCGT CAGTCAGATC GTCAGCACCG TCGCCTTTCC TGTTCTGGCG
CATACCCAAT CGGATCCGGC AAAGACGCAC GACATGTATT TCGCCTATCT GCGCTATATG
GCGCTGGTGA CCTTTCCCGC AGGCGTTGGG CTGGCGCTGT TGTCGCCGGC GCTGGTGCGG
GTCTTTTTTG CCGAGGTATG GCGTCCGATG ACGGCGCCAA TGCAGTTCAT CGCCATCGCC
AGCGCCTTTT CCATCGTGTC GTATCTGTCG GGGATCATTT ACAATGCGAT TGGGCGGCCT
GATCTGACTT TTAAATTGTC GCTGGCGAAA CTGCCGATTG TTGTGCTGGT GCTCTCCATC
GGCACGTTCT GGAATATTAC GGGCGTGGCT GCCGGACATG TCGCGCTGAC GCTGGTGTGT
ATGGCGCTCG ATTTGGTGAT GATCCGACGG GTGACCGGTG TGCGACTGAT GGGCGTGTGG
CATGCGGTGC AACCAGCGTT GTTGGGCGCA GGGGTGATGG CAGCCGTTGT TGGTGCGCTC
GACGCGATGC TGACGGGTGC GCCCATCGTG CAATTGGCGG CGCTGCCACC GATAGGCGCC
CTGGTCTATC TCGGAACTAT CTGGATCGCC GGACGTGAGA TGTTTCTGGA GGCGCGCTCG
GTGCTGCGCG GTAGTCTGGC GCGCGGTTGA
 
Protein sequence
MQSTTTSQSE GVGKRAVRGT FWSFLSYTSG RLVTFVTTLI LARLLAPAEF GVIAYCTLVI 
AYLDLLNNFG VGHALIARRD RLEEAQNAAF VVSIGSSVFL YAGAWIAAPS IAVFFNEPQV
TPLLRVLSLG LLLVGIGTVP MAMLQRDLRF KAYLLPGIVR NIIKAVVAIS MAWQGFGVWS
LVVSELVNKV LEVIIPWLIV RWRPTRAFDP QVMREMLGYG VHIMGVSLVG SFMVNVDYLL
VGRLLGAAAL GYYTMAFRIP ELVIRSVSQI VSTVAFPVLA HTQSDPAKTH DMYFAYLRYM
ALVTFPAGVG LALLSPALVR VFFAEVWRPM TAPMQFIAIA SAFSIVSYLS GIIYNAIGRP
DLTFKLSLAK LPIVVLVLSI GTFWNITGVA AGHVALTLVC MALDLVMIRR VTGVRLMGVW
HAVQPALLGA GVMAAVVGAL DAMLTGAPIV QLAALPPIGA LVYLGTIWIA GREMFLEARS
VLRGSLARG