Gene Rcas_0733 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0733 
Symbol 
ID5538198 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp957451 
End bp958908 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content61% 
IMG OID640892888 
ProductO-antigen polymerase 
Protein accessionYP_001430872 
Protein GI156740743 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.480414 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCTTT TATCACATTC TCCCCGTGAT GACCTGGCGA TAACGCTGGG ACGCAGCCCG 
CTGATAGCGG CGCTGGCGGC GGTGCTGCTC GGTGTGATTG GCGGCGCTGG AGTGGCGTTC
GGTCCGTTCT GGCTGGGTTT TGCGGCGCTG GCGGCGCTCA TGGGCGTCTA TGCGCTGCTG
ATCGACACCC GCGTCGGACT GGCGGTGGTG ATCGGCATTG CCACCATCAT TCCGTTTGCC
ACATTGCCCT TTCGCGCTGT AGTGACCCCG ACGCTCCTGA CACTGGCGCT GGCGGCGCTT
ATGGGGGTCT GGTCGCTACG GTTGCTGGTG CGCCGCGACG AGCGCCTGAT CGTCACGCCG
ATTGGTTTGG CGATCATCGG TTTTTTGGGG ATTACGCTGT TTGCCTTTCT ACTCGGTTCC
AATGCCAGTC CTGAACCCTC GCTGATCCAC AACTATGTCA AATTTGTGAT GGCGACGCTC
TTCTTCTTCA GCGTTGTTAA TTGTGTGCGC GACCGTGAGA CGGTGCGCTG GGTCATCCGT
TTTCTGATCA TCGGTGCTGC GCTGTCCGCA TTGATAGCCC TGGTTCTGTA CGTTATTCCC
GATCAACTGG CGCTTCAGAT TCTGGTATCG TTTGGGCGTA TCGGCTATCC GACCGAAGGG
CGTGTGCTGC GGTATGTTGA GGATGATCCC AACGGCTTGA TGCGCGCCAT CGGTCTCTCG
GTTGATCCCA ACAGTTTTGG CGGGATGCTG GCATTGATCG GCGCCCTGGC AGCGACTCAG
GCGGTGAGCG AACGTCCGGC GCTGCCGCGA CGGTTGTTGC TGATCGCTAC CGGCGCAATC
CTGCTGGCGT TGTTCCTGAC CTACTCGCGC GCAGCGCTTG GCGGCATGAT CGTTGCGGCG
ATGTATGTGG CGACACTGCG CTATCGGCGG CTCTGGTGGG TCATCCTGGC GGTTGGAGCG
CTGGCTGCCG CGCTCTTCAT CGGGCTTGGC GTCGGGGAAC GTTTCGTTGA GCGCGTGGTT
GAGGGGGTGC AGTTCCGCGA CCGGGCGAAC CAGATGCGCC TGGCGGAGTA CCAAAACGCA
ATTGCGATCA TTCAGGCATA TCCGGTATTC GGCATTGGTT TTGGTCAGGC GCCGGAGATC
GATCTGGTGG CGGGAGTGTC GAGCATCTAC CTGGCGATTG CACAACGCAC CGGTCTGGTC
GGTCTGACAG CGTTCCTGAG CATCATAGCC TGGTTTTTCG CGCGCAACTG GAGTGTGCTG
CGCGCTGCGG CACGCTCAGG CGATGAAGAG CGCGCTGCGT GGCTGGTGGC GCTCCAGGCG
GCGCTGGCGG CGGCATTGGC GGTGGGTTTG CTCGATCACT ATTTCTTCAA TATCGAGTTC
AGCCACATGA GCGCGCTGTT GTGGGGCACA GTCGGTCTGG CAGTGGCGAT TGAAGGATTG
GAAGAGGAGA CAGATTGA
 
Protein sequence
MSLLSHSPRD DLAITLGRSP LIAALAAVLL GVIGGAGVAF GPFWLGFAAL AALMGVYALL 
IDTRVGLAVV IGIATIIPFA TLPFRAVVTP TLLTLALAAL MGVWSLRLLV RRDERLIVTP
IGLAIIGFLG ITLFAFLLGS NASPEPSLIH NYVKFVMATL FFFSVVNCVR DRETVRWVIR
FLIIGAALSA LIALVLYVIP DQLALQILVS FGRIGYPTEG RVLRYVEDDP NGLMRAIGLS
VDPNSFGGML ALIGALAATQ AVSERPALPR RLLLIATGAI LLALFLTYSR AALGGMIVAA
MYVATLRYRR LWWVILAVGA LAAALFIGLG VGERFVERVV EGVQFRDRAN QMRLAEYQNA
IAIIQAYPVF GIGFGQAPEI DLVAGVSSIY LAIAQRTGLV GLTAFLSIIA WFFARNWSVL
RAAARSGDEE RAAWLVALQA ALAAALAVGL LDHYFFNIEF SHMSALLWGT VGLAVAIEGL
EEETD