Gene Rcas_0465 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0465 
Symbol 
ID5537928 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp592971 
End bp594431 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content51% 
IMG OID640892628 
ProductO-antigen polymerase 
Protein accessionYP_001430614 
Protein GI156740485 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.216322 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.610525 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGCAA CGTTTATGCG CACAATGCTC AACCAAAAGA GGCGTCACCT GAAGAGCAGC 
CTGACAGCAC GACAACCCCT TCCTGTATTT CGTTCCTTCC TGATAGTCGG CGTGTTGCTC
GGCATTGTCG GATTGAGCCT GGCGGCTGGG TACGTCATCG GCGCCGATCG GAAATTCGTC
GCGCTGGCGA TCTTCGGTCC GCTGGCGGCG ATACCGGTTT TCTTTCTTGT CAGTCGGCAT
TTTACCTATG CGATACTGTC TTTGCCCATA GCAGCAATGG CGATCCCGAT TGATATACCG
ACGGGAACGT ATACGAAAAT ACCGATCTCA CTGGTTATTG CAACAATGCT CTGCGGTATA
TGGATCACGT CGATGGCGAT CCGCAACAAT TGGCGACTGG CGCCGTCGCC GATTAACCGT
CCAATGATCG TATTCTGCAT TGCCTGTACG ATATCGTTGA TATGGGGAAT TGTATGGCGT
GATCCTATAC TACGGATGGA TATATTTTCG AACTTCATCG TTGTTCAGAT CGCTTCTCTC
GTTACTTATG CAGTTTCTGT CGGGGTTGCG CTGTTGATCG GTAATTTTCT TTGGAATGAG
GGTCAAATCA AGTATCTTAT CGGTTGCTTT TTGTTTTTTG GGTCGCTCAT GACCATATTT
CAAATCTTGA GGATCGATCA TAGAATCCTT ACAGACCGTG GATTATGGGG ATTATGGACG
GTTATTCCTG CTTATGCGCT GCTGATTACG CAACCCGGCT TGCGCTTGCG TTGGAGATTG
CTTCTGCTGG CGCTCATTGT CGCTAATCTG TATCAAACGA TCCTGATCAA TCTGCTCTGG
AAATCGGGAT GGATTCCGAC GGTTATTGCT ATCTTTGCAG CGACATTGAT CCGTTCACGG
CGTTGGTTCG TTGTGCTGGC CGTTGCAGTG ATTGTGTTGG TATACACACA ACAAGATTTT
TTCAATCAGA TGATCGAAAC AGAGTTGAAC GAAGGCGCCG ATGGTCGGAT CGGTATGTGG
GAGATCAATC TGCGCGTGGT TGGCGAACAT TGGCTGTTCG GCACCGGTCC TGCCGGGTAT
GCGCCGTACT ATATGACCTA CTACCCCTAC GATGCGCGCT CGACGCACAA CAACTATCTC
GACATCATTG CTCAGTTTGG CGTGGTTGGT TCGATAATCT GGCTCTGGTT CGCGTTTGCC
AGCACAAGCG AAGGTTTGCG CCTCTACCGT GAAGCGCCGC CTGGATTCCT CAAAACCGCA
GCATTGACCA CGGTCAGCGG TTGGATCGGC GCGCAGGCAT CGATGTTCTT TGGTGACTGG
ATTCTGCCGT TTGCCTACAA CCAGACGATT AACGGCTTCA AATACACGGT TTATAGCTGG
TTTTTCGTCG GTCTCCTGAT CAGCCTCCGG CAGATCATTG AGCGGCGCAA AGCGACCCAA
ACGGTGAGCA ATAGCGTATG A
 
Protein sequence
MSATFMRTML NQKRRHLKSS LTARQPLPVF RSFLIVGVLL GIVGLSLAAG YVIGADRKFV 
ALAIFGPLAA IPVFFLVSRH FTYAILSLPI AAMAIPIDIP TGTYTKIPIS LVIATMLCGI
WITSMAIRNN WRLAPSPINR PMIVFCIACT ISLIWGIVWR DPILRMDIFS NFIVVQIASL
VTYAVSVGVA LLIGNFLWNE GQIKYLIGCF LFFGSLMTIF QILRIDHRIL TDRGLWGLWT
VIPAYALLIT QPGLRLRWRL LLLALIVANL YQTILINLLW KSGWIPTVIA IFAATLIRSR
RWFVVLAVAV IVLVYTQQDF FNQMIETELN EGADGRIGMW EINLRVVGEH WLFGTGPAGY
APYYMTYYPY DARSTHNNYL DIIAQFGVVG SIIWLWFAFA STSEGLRLYR EAPPGFLKTA
ALTTVSGWIG AQASMFFGDW ILPFAYNQTI NGFKYTVYSW FFVGLLISLR QIIERRKATQ
TVSNSV