Gene Rcas_2975 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2975 
Symbol 
ID5540467 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3858579 
End bp3860228 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content62% 
IMG OID640895093 
ProductO-antigen polymerase 
Protein accessionYP_001433050 
Protein GI156742921 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.024758 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000748304 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAAACGAA CGTGGGGCAG TGTTCCTTTC CTGAGACCGT TGATATTGTT CGTCATTGGC 
GGCGTGCTGG GAACGCTTGT CGCCTATGAT CCTGTCGTAA GTCTGGCGTG GCTGGCGCCC
ATGATCGCAG GAGCGGCATT GTACCTCAGT ATGATGACCG TCCTGCGCCA TCGATTGGCG
ATTATTGCTC TGGTTCTGGC GTTCTGGAGC ATCGGTTACA GCGTCTTGCT GGCGACGCAG
TATCGGTATC TGGGGTTCGA TGAGAAACTG GGGCTCGCCA TATGGCTTGG ACGCCTGTTC
AGTTCGCCTT TCCCGGATGT GACGCCGGCG TTCATCGATG CCAATGCTGC GGCATCGTTC
CTGGCGCCGG CGATACCGCT TATCATCGGG CTGGCATGGA CCGCGCGCGG CGTCTGTCGC
GTAGCATGGG GCATTGCCGC CGGTAGTGTT GCCTTTGGCG TGTTGCTCAC ATCTTCACGC
GGTGCGTTCG TTGCACTGGC GGCGGCAGGA CTCTTCTGGC TTCTGGTGCG CGTGCAGGCG
TCTGCACATC AATCTGGCGC CCATATGCCT CGCTTCGACC TGCGGAGCGC CATCGTTGCC
GGCGCCGTCA TAGCCGGGGT GGTCGCTGGC GGACTTCTGC TGGTCTGGCA TCCGCTGACG
CAGGACGCCC TCGCATCGGC GATGCTGCGC GCCGAGGATC GGCTGGCAGT CTATCGCAAT
AGCCTGTTTC TGGCGCTCGA TTTTCCGTTC AGCGGCATTG GACCGGGGGC AGTGTTCGGG
CAGATGTACT CGCGCTTTCA GTTGCTCATC ATTCCCACCT ACATCGGTTA TGCGCACAAT
CTGTTCCTCG GCGTCTGGCT GGCGCAGGGC ATCATCGGGC TGATCGGCTT TCTCTGGTTG
CTCATCGCGT CGTTGCATCG CATTGCGCCG ACGCTCCATA CGCAATCTCC GCTGACACAG
GGGGCGGCAA TCGGGTGCGT TGCGCTGCTG TTTCATGGGT TGACCGACGC GCCGCAGTAC
GCTACATCCT GGGCGACCCT GATCCTGGCA TTTGGACTCT TTGGCATGAC TGCCGCTACC
TGCCGTCCAA CAGAGGCGCT ATTGCTCGCT GTTGCGCCTG CAACAAAACG GCACAGCATT
TGTTCGTGGG TCGTCGCTAT TGCAGGAGTC ATTGGGCTGA CATTGAGCGC GCCCCATCTT
GCGGCTGCGG GTGCGGGCAA CATTGCCGCA GGGTTCCAGG CGCGCGCCAT GCTCGCCGAA
GGGTTGACGC AAGAGGAACG CGCCGCGTTG ATGCACGAGT CGGTCGTCTG GGTCAATCAT
GGGCTGCGCA TAGCGCCAGA TTCGCCGCTG ATCCAGAAGC GGCTAGGCAT GCTGGCGCTC
GATCTGGGGG ATTATCCACG CGCGATCAGT GCCCTCGAGC GCGCACAACC ATTGCTTGCC
GATGATCAGG CGGTATGCAA GGCGCTTGGC ATGGCGTATG TGTGGACCGG CGATCCCGAC
CATGGCGCCG AAATCCTGGC GCACCTCGAC TATGCCGATG AGGTGCGCGA AGAACTGGGC
ATCTGGGTGT ATGCCTGGCA GGAGCGGGGG CGCGACGATC TCGCCGCTTA TGCGCAACGC
GCCGCGCAGG CAATGGCGGC AATTCACTGA
 
Protein sequence
MKRTWGSVPF LRPLILFVIG GVLGTLVAYD PVVSLAWLAP MIAGAALYLS MMTVLRHRLA 
IIALVLAFWS IGYSVLLATQ YRYLGFDEKL GLAIWLGRLF SSPFPDVTPA FIDANAAASF
LAPAIPLIIG LAWTARGVCR VAWGIAAGSV AFGVLLTSSR GAFVALAAAG LFWLLVRVQA
SAHQSGAHMP RFDLRSAIVA GAVIAGVVAG GLLLVWHPLT QDALASAMLR AEDRLAVYRN
SLFLALDFPF SGIGPGAVFG QMYSRFQLLI IPTYIGYAHN LFLGVWLAQG IIGLIGFLWL
LIASLHRIAP TLHTQSPLTQ GAAIGCVALL FHGLTDAPQY ATSWATLILA FGLFGMTAAT
CRPTEALLLA VAPATKRHSI CSWVVAIAGV IGLTLSAPHL AAAGAGNIAA GFQARAMLAE
GLTQEERAAL MHESVVWVNH GLRIAPDSPL IQKRLGMLAL DLGDYPRAIS ALERAQPLLA
DDQAVCKALG MAYVWTGDPD HGAEILAHLD YADEVREELG IWVYAWQERG RDDLAAYAQR
AAQAMAAIH