Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_2975 |
Symbol | |
ID | 5540467 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 3858579 |
End bp | 3860228 |
Gene Length | 1650 bp |
Protein Length | 549 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640895093 |
Product | O-antigen polymerase |
Protein accession | YP_001433050 |
Protein GI | 156742921 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.024758 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00000748304 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAAACGAA CGTGGGGCAG TGTTCCTTTC CTGAGACCGT TGATATTGTT CGTCATTGGC GGCGTGCTGG GAACGCTTGT CGCCTATGAT CCTGTCGTAA GTCTGGCGTG GCTGGCGCCC ATGATCGCAG GAGCGGCATT GTACCTCAGT ATGATGACCG TCCTGCGCCA TCGATTGGCG ATTATTGCTC TGGTTCTGGC GTTCTGGAGC ATCGGTTACA GCGTCTTGCT GGCGACGCAG TATCGGTATC TGGGGTTCGA TGAGAAACTG GGGCTCGCCA TATGGCTTGG ACGCCTGTTC AGTTCGCCTT TCCCGGATGT GACGCCGGCG TTCATCGATG CCAATGCTGC GGCATCGTTC CTGGCGCCGG CGATACCGCT TATCATCGGG CTGGCATGGA CCGCGCGCGG CGTCTGTCGC GTAGCATGGG GCATTGCCGC CGGTAGTGTT GCCTTTGGCG TGTTGCTCAC ATCTTCACGC GGTGCGTTCG TTGCACTGGC GGCGGCAGGA CTCTTCTGGC TTCTGGTGCG CGTGCAGGCG TCTGCACATC AATCTGGCGC CCATATGCCT CGCTTCGACC TGCGGAGCGC CATCGTTGCC GGCGCCGTCA TAGCCGGGGT GGTCGCTGGC GGACTTCTGC TGGTCTGGCA TCCGCTGACG CAGGACGCCC TCGCATCGGC GATGCTGCGC GCCGAGGATC GGCTGGCAGT CTATCGCAAT AGCCTGTTTC TGGCGCTCGA TTTTCCGTTC AGCGGCATTG GACCGGGGGC AGTGTTCGGG CAGATGTACT CGCGCTTTCA GTTGCTCATC ATTCCCACCT ACATCGGTTA TGCGCACAAT CTGTTCCTCG GCGTCTGGCT GGCGCAGGGC ATCATCGGGC TGATCGGCTT TCTCTGGTTG CTCATCGCGT CGTTGCATCG CATTGCGCCG ACGCTCCATA CGCAATCTCC GCTGACACAG GGGGCGGCAA TCGGGTGCGT TGCGCTGCTG TTTCATGGGT TGACCGACGC GCCGCAGTAC GCTACATCCT GGGCGACCCT GATCCTGGCA TTTGGACTCT TTGGCATGAC TGCCGCTACC TGCCGTCCAA CAGAGGCGCT ATTGCTCGCT GTTGCGCCTG CAACAAAACG GCACAGCATT TGTTCGTGGG TCGTCGCTAT TGCAGGAGTC ATTGGGCTGA CATTGAGCGC GCCCCATCTT GCGGCTGCGG GTGCGGGCAA CATTGCCGCA GGGTTCCAGG CGCGCGCCAT GCTCGCCGAA GGGTTGACGC AAGAGGAACG CGCCGCGTTG ATGCACGAGT CGGTCGTCTG GGTCAATCAT GGGCTGCGCA TAGCGCCAGA TTCGCCGCTG ATCCAGAAGC GGCTAGGCAT GCTGGCGCTC GATCTGGGGG ATTATCCACG CGCGATCAGT GCCCTCGAGC GCGCACAACC ATTGCTTGCC GATGATCAGG CGGTATGCAA GGCGCTTGGC ATGGCGTATG TGTGGACCGG CGATCCCGAC CATGGCGCCG AAATCCTGGC GCACCTCGAC TATGCCGATG AGGTGCGCGA AGAACTGGGC ATCTGGGTGT ATGCCTGGCA GGAGCGGGGG CGCGACGATC TCGCCGCTTA TGCGCAACGC GCCGCGCAGG CAATGGCGGC AATTCACTGA
|
Protein sequence | MKRTWGSVPF LRPLILFVIG GVLGTLVAYD PVVSLAWLAP MIAGAALYLS MMTVLRHRLA IIALVLAFWS IGYSVLLATQ YRYLGFDEKL GLAIWLGRLF SSPFPDVTPA FIDANAAASF LAPAIPLIIG LAWTARGVCR VAWGIAAGSV AFGVLLTSSR GAFVALAAAG LFWLLVRVQA SAHQSGAHMP RFDLRSAIVA GAVIAGVVAG GLLLVWHPLT QDALASAMLR AEDRLAVYRN SLFLALDFPF SGIGPGAVFG QMYSRFQLLI IPTYIGYAHN LFLGVWLAQG IIGLIGFLWL LIASLHRIAP TLHTQSPLTQ GAAIGCVALL FHGLTDAPQY ATSWATLILA FGLFGMTAAT CRPTEALLLA VAPATKRHSI CSWVVAIAGV IGLTLSAPHL AAAGAGNIAA GFQARAMLAE GLTQEERAAL MHESVVWVNH GLRIAPDSPL IQKRLGMLAL DLGDYPRAIS ALERAQPLLA DDQAVCKALG MAYVWTGDPD HGAEILAHLD YADEVREELG IWVYAWQERG RDDLAAYAQR AAQAMAAIH
|
| |