Gene Rcas_3103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3103 
Symbol 
ID5540599 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4020824 
End bp4022320 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content60% 
IMG OID640895222 
ProductO-antigen polymerase 
Protein accessionYP_001433175 
Protein GI156743046 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3307] Lipid A core - O-antigen ligase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.313581 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACCTGG AACGTCTTCT CAAAAATCCG ATGGACGTTA CTGCCGGTCT CTGGCGACAC 
GTCTGGTTTC AGGTCGGCGT GGTGGTGGCT GTTTCCATCG CCGCCGGAAT GCTGGTGTCG
CGCGACGTGC CGCTCTGGTT GTTCTTCGGC GGCATCGCGG GGATTGTTGT CGTTGCAATT
GCCTTTATCA AACCGGAGTA TGTCGCGGCG GCGCTGCTGG TGATCCACTG GGGAAACATC
CACGACGTGC TGATCAAGTA TCACGGCATT CCTTCGGTTG TGAAACTGAT GGTTGCCCTG
CTCACCGTGG TATTGCTGGC GCGGCGATTT CTTTCCGAGC GACCGCGCGG TCTGGTGTCC
GACCCGGTGA TCTGGTGGAT GCTGGCATAC CTGGTGGTCG GAGCGACAGG TCTGTGGTTT
GCGCGCGATA CCGACGCGGT GCTGAGTCGT CTCGTTGATA CGGCGAAGGA CGGTGTCATT
GCCGGGTTGA TCTTCAACCT GCTTTCGACA CGCGCAGCAT TCGAGCGCGC GGTGTGGGGT
TTGTTGATCG TGGGTGCCGT GTTGAGTGCG TTGACGGTCT ATCAGGAGAT GACCAAAACC
TACGACAACA ACTACTGGGG ATTCGCGCAG GCGGCGGTGC GCCAGATTTC GACGACCATG
GACGACCGCG CGCGTGCGTT TGGCACAGTG AACGATCCAA ACTATTTCGG GCAGTTGTTG
CTCGTGCTCG TGCCGCTGGC GGTGTGGGCC ATTCTGAACG GGCGGACCTG GCGAGGGAAA
TCGTTTGGGA TGGCGGCGTT GTTGCTGTTG CTCGCAGCGA TTGGGTTGAC CTTCTCGCGC
GGCGCGTATC TCGGTGCTGT GGTCGTGCTG GTCGTCTATG CGATGTACCT GCGACTCGAT
GCGCGTTATT TGCTGATCCT GCCGCTGATC GGCGCGCTGC TCTATGTTGC GCCGCCGGAG
TTCCGCGCGC GCTTTGGCAC GCTCGATGAA GTGTTGCCGG GCAACAATGC GGGCGCCTAT
GCCGATAGTT CGATTCAGGG GCGCTCGGTC AAGGCGGAGG TTGCGATTGC CATAGTGGCC
GACAATCCGA TCTTCGGCGT CGGGCGCGGA AACTATCGGT TGCACTACCG CGACTATATT
AACGAAATCG AGGGAGCCGG CTCGAATACT GAACGTGATG CGCATAATCT ATACCTGGAA
GTCGCTGCGG AACAGGGGAT TGTTGGTCTG GTCGTCTTCG TTGGGTTGCT GGCAACTGTA
TGGGGACGCT TGCGCGCAGC CGAACTCCTG TTCGTAGCGG CAGGCGAACG TCGTATGGCG
GACCTCTCGG TTGCGGTCAA GGTTGGGTTG CTGGGGTATC TGGTGACGTC GCTCTTTCTG
CACGGCGCGT ATGGCTATAT GCTCTGGCTC CAGGTCGGCA TGGCGGTTGC GCTGGTTGTT
ATTGCACAGC GAGAAGCAGC AGCGCACGCG GATCATGCGG TGAAAGCGAG CCGGTAA
 
Protein sequence
MYLERLLKNP MDVTAGLWRH VWFQVGVVVA VSIAAGMLVS RDVPLWLFFG GIAGIVVVAI 
AFIKPEYVAA ALLVIHWGNI HDVLIKYHGI PSVVKLMVAL LTVVLLARRF LSERPRGLVS
DPVIWWMLAY LVVGATGLWF ARDTDAVLSR LVDTAKDGVI AGLIFNLLST RAAFERAVWG
LLIVGAVLSA LTVYQEMTKT YDNNYWGFAQ AAVRQISTTM DDRARAFGTV NDPNYFGQLL
LVLVPLAVWA ILNGRTWRGK SFGMAALLLL LAAIGLTFSR GAYLGAVVVL VVYAMYLRLD
ARYLLILPLI GALLYVAPPE FRARFGTLDE VLPGNNAGAY ADSSIQGRSV KAEVAIAIVA
DNPIFGVGRG NYRLHYRDYI NEIEGAGSNT ERDAHNLYLE VAAEQGIVGL VVFVGLLATV
WGRLRAAELL FVAAGERRMA DLSVAVKVGL LGYLVTSLFL HGAYGYMLWL QVGMAVALVV
IAQREAAAHA DHAVKASR