Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_1657 |
Symbol | |
ID | 5208614 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 2034406 |
End bp | 2035971 |
Gene Length | 1566 bp |
Protein Length | 521 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640595265 |
Product | phage tail sheath protein |
Protein accession | YP_001275999 |
Protein GI | 148655794 |
COG category | [R] General function prediction only |
COG ID | [COG3497] Phage tail sheath protein FI |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.713042 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0422731 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGAGT ATCTCTCGCC TGGCGTCTAC ATTGAAGAGG TCAGCAGCGG TCCCCGCCCG ATCGAAGGCG TCGGTACGGC AATGGCGGCT TTTGTCGGGT TTGCTGCCGC CGGTCCTGTC AATCAACCGG TGTTAGTCAC AAGCTGGACG CAGTATGTTG AGAAGTTCGG TCGGCTTGAC GAGAGCGGAC GGCGGAATCC GCATATGGAT GGCGCCTATC TGTCGCACGC CGTGTACGGC TATTTCCTCA ACGGCGGCGG TCGCTGCTAC GTGACCCGCA TTCCGCAGCA GTCGGACGGC AAAGCGCCGC AGATGCCGCG TCTCGAACTT CCCACACGCG CGTCGAAAGC GTTGACATCG CTGATTGTGA CGCCCAAGAG CGAGACCGCC AGCGACATCC AGGTCGAAAT CGGTCCGCCG GTTGGCGAAA ATCCGCCGCC TGAGGCGTTC ACCGTCAAGA TCAGCATGGG AGAGATCAAA GAGGTCTACG AGAACGTTTC GTTCAACAAA CGACCCAAAG ATGGCACGTC CTACGTCGTC GAGAAGATCA ACAGTTCCAG CACGCTGGTG CAGGTTGCCG AAGGTCCGGC AACCGGTTCG CTGGCGGATC GCGTGCCGGA GTTTGGCATG TCGGTCATCA AGCCGCCGGC GCCGCTGGCG CCAGCGCGAG TGGATGCCAC GTCATTCGTC GGCAGTGCAG CCGAGCGCAG CGGTGTCGAA GGTCTGGAGA TCGCCGAGGA TGTGACGATG ATCTGCGCGC CAGACCTGAT GTCCGCTTAT CAGTCAGGCG CGATCACGAA GGAAGGGGTC AAAGCGGTTC AACTGGCGAT GATTGCCCAT GCCGAGCGCA TGCAGGATCG CATGGTCATT CTCGATCCAC TTCCCGGTCT GACGCCGCAG CAGGTCAAGC AGTGGCGTGA GCGCGACACG AACTATGACT CGAAGTTCGC TGTGCTCTAC TACCCGTGGA TCAAGATCAT GGGACCTGAT GGTAAGACCG AGATGGAAAT TCCGCCATGC GGTCATATTG CAGGCATCTG GGCGCGCAAC GATAATACGC GCGGCGTCCA CAAGGCGCCA GCGAACGAAG TGGTGCAGGG TGCGTTGGGT CCGGCAATCG CCATCACCAA GGGTGAGCAG GATGTGCTCA ACCCGATCGG CGTCAACTGC ATCCGCTCAT TCACCGGCAT GGGGTTGCGG GTCTGGGGTG CGCGCACCCT TTCGAGCGAC GCCGCCTGGC GCTATGTCAA TGTGCGTCGC CTGTTCAACT ACGTCGAGAA GTCAATCGAA CGCGGCACAC AGTGGGTGGT CTTCGAGCCG AACGATCCGA ACCTGTGGGC GCGGGTCAAG CGGGATGTCG AAGCGTTCCT GACCGTCTGC TGGCGTGATG GCATGCTGTT CGGTCTGACG CCGCGCGAAG CGTTCTACGT CAAGTGCGAC GAAGAACTGA ACCCGCCCGA AGTGCGCGAT CAGGGCAAAC TGATCATCGA AGTCGGGCTG GCGCCGGTCA AGCCCGCCGA GTTCGTGATC TTCCGCTTCA GCCAGTTCGC TGGCGGCGGA GCATAA
|
Protein sequence | MPEYLSPGVY IEEVSSGPRP IEGVGTAMAA FVGFAAAGPV NQPVLVTSWT QYVEKFGRLD ESGRRNPHMD GAYLSHAVYG YFLNGGGRCY VTRIPQQSDG KAPQMPRLEL PTRASKALTS LIVTPKSETA SDIQVEIGPP VGENPPPEAF TVKISMGEIK EVYENVSFNK RPKDGTSYVV EKINSSSTLV QVAEGPATGS LADRVPEFGM SVIKPPAPLA PARVDATSFV GSAAERSGVE GLEIAEDVTM ICAPDLMSAY QSGAITKEGV KAVQLAMIAH AERMQDRMVI LDPLPGLTPQ QVKQWRERDT NYDSKFAVLY YPWIKIMGPD GKTEMEIPPC GHIAGIWARN DNTRGVHKAP ANEVVQGALG PAIAITKGEQ DVLNPIGVNC IRSFTGMGLR VWGARTLSSD AAWRYVNVRR LFNYVEKSIE RGTQWVVFEP NDPNLWARVK RDVEAFLTVC WRDGMLFGLT PREAFYVKCD EELNPPEVRD QGKLIIEVGL APVKPAEFVI FRFSQFAGGG A
|
| |