Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_4235 |
Symbol | |
ID | 5211220 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 5305909 |
End bp | 5307333 |
Gene Length | 1425 bp |
Protein Length | 474 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640597824 |
Product | O-antigen polymerase |
Protein accession | YP_001278528 |
Protein GI | 148658323 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000407687 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.0000107528 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | GTGTATTCCA TGATTACGCG GCTCACAACA ACCATTTCAC AACGCTCGAT GGTGATCATT GGTATCACGA TTACGATTGT GATGACCCTA AGCCTTGCGA TAGGTGTTAT GGCTGGATTG GGCCATAATT ATCGTGTGCT CATAATTGGA AGCTTACCGA TCATTGTAGC GGCAATAATC TTTGTGCTTC GGCGTTTCGA TATAGCTGTC CTGTCGATTC CACTGACGGC ATGGATTGCA GTTTATGATA TACCTGCGGG CAACTATTCA AGAGTTCCCG TCTCTCTTAT CGCTACCCTT GGTCTCTGCG CGATCTGGAT AACGTCAATG ATTATTCGTC ATCAGTGGCG ACTGGCATCG ACGCCGCTCA ATCGACCATT GATCGTTTTC GGGATCATCT GTATTGTCTC ATTGGTGTGG GGTATTATCT GGCGCGATCC TATTCTGCTT ATGGAACGTG TTGGCGGTGA TCGATTTCTA ATCGTTCAAT TTGCATCTTT GCTTTCATTT CTAGGATCTA TTGGATCAGC TCTGTTGATC GGTAACTTTG TCCAGACGAG CGGGCGTCTG AAGTTTATCT TGGGATTGTT TCTTGTCCTT GGTGGAATCT CGACGATGTT GTTAATATTC AAGATATCTC CTTTTCCCTT AAACCCACAC GGCTTGGGTG GTTTATGGTC GACTATATCG GCTTATGCTC TGGCTCTATT CCACCCAAGA CTTCGTTTAC GGTGGCGAAT TCTTCTGATT GGATTGGTTT TGATACAAAT GTACTTAGCA ATTGTCATTA ATATAACCTG GAAGTCAGGA TGGGTTCCGA CAACTATTGG TTTATTCATC GTGACATGGT TGCGTTCAAG GCACGCATTT TTTATTCTAC TAATGACTGT AATTATTGCT CTCGTTCTCA ACAGTGAACT GGTGATGCAG GTTGTTAATG ATGAACTTGA AGAGGGTAGT GATGGGCGAA TTAGCATGTG GGAGATAAAT TTACGTGTAG TTGGTGATCA CTGGTTGTTC GGTACCGGTC CAGGGGGATA TGCAGCATAT TACATGACGT ACTTTCCGCA TGATGCTCGT TCCACTCATA ACAATCATTT AGATATTATC GCTCAGTTCG GTATTTTAGG TGCCATCGTG TGGTTTTGGT TGAGTTTGGC AGGTCTTCGC GAGGGTTGGC GTCTTATCAA ACAAACTCCT CCTGGATTGC TCCATGTTCT CGCAGTGACG ATCACTACCG GGTGGATCTG CGCTCAGGTT GCGATGTTTT TCGGCGATTG GGTGCTGCCG TTCGTCTATA ATCAAACCAT TACAGGCTTC AAATACACAG TGTACACATG GATATGGTTA GGAATGTTGA TCAGCATACG AACGATCCTG ACCAGGCAGC AAGCACAGGA GAACGTCTCG AATACCCAGA GATGA
|
Protein sequence | MYSMITRLTT TISQRSMVII GITITIVMTL SLAIGVMAGL GHNYRVLIIG SLPIIVAAII FVLRRFDIAV LSIPLTAWIA VYDIPAGNYS RVPVSLIATL GLCAIWITSM IIRHQWRLAS TPLNRPLIVF GIICIVSLVW GIIWRDPILL MERVGGDRFL IVQFASLLSF LGSIGSALLI GNFVQTSGRL KFILGLFLVL GGISTMLLIF KISPFPLNPH GLGGLWSTIS AYALALFHPR LRLRWRILLI GLVLIQMYLA IVINITWKSG WVPTTIGLFI VTWLRSRHAF FILLMTVIIA LVLNSELVMQ VVNDELEEGS DGRISMWEIN LRVVGDHWLF GTGPGGYAAY YMTYFPHDAR STHNNHLDII AQFGILGAIV WFWLSLAGLR EGWRLIKQTP PGLLHVLAVT ITTGWICAQV AMFFGDWVLP FVYNQTITGF KYTVYTWIWL GMLISIRTIL TRQQAQENVS NTQR
|
| |