Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_3898 |
Symbol | |
ID | 5541404 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 5101620 |
End bp | 5103023 |
Gene Length | 1404 bp |
Protein Length | 467 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640896009 |
Product | hypothetical protein |
Protein accession | YP_001433952 |
Protein GI | 156743823 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000694886 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATCCGG TCTGGGAAGG TTTCTTCATC CACTCGGCAA TGTTCGTTGC CATCGTGTCC GCTTTCCACG TGCTCGCTTC GCACCTGACG GTCGCAGCCG CGTGGTTCAA CCTGTACCTG GAGCGGCGCG CAGTATACGA GAATCGCCCG GAACTATACG TCTATCTCAA GCGAAGCGCC CTGGGATTGC TGGTGTTCGC GTATGTCTTC GGAGCGATGG CCGGAGTGGG CATCTGGCAA ACAACCACCG CCGCTAACCC GCGCGGCATT TCTACCCTCA TCCACAACTT CGTGCTCTAC TGGGGATCAG AGTGGTACAT GTTCCTGATC GACGTGGTAG GAATCATTGC GTATTACTAC ACGTTCGAGC GCGTCAGCCC GAAGACGCAC CTGCGTCTGG CATGGATCCT GGCATTGGGA GGCACCGGTA CACTGACAAT CATCGTTGGC ATTCTGTCGT TCAAATTGAC GCCGGGCCTC TGGTTCGAAA CGGGGGCGAG TCTGAACGGA TTCTTCAATC CCACGTTCTG GCCCCAACTC TTCATGCGAT TTGCCCTGAT GTTTACCATC ACGGCAGCCT GGGCGCTTTT GATTGTCACC GGACTGCCGA ACGGGTACTT TGCGCGTGAG CGCATTATTC GCATTGCGGC AGTCATGGGA CTGGGCGGGT TGATCGTTGC GCTGGGCATC TGGTTCTTCT GGTACGACCC CACTCTGCCG GCTCACGCGA AGACCATTCT GCGCTCGCCT GCCATTCCAC CGATCACCTT CACGGTCATC ATCGGCGGTC TGATTGCGAC ATTCCTGGGG CTTCTGTTTG CGCTGGTCAT GCCGCGCCGT CAGCATCAGA TTATCGCCCT GGGCGCAATG CTGGTGTTGT TTGCAGCCAT CTTCGGCGCC GAACGCACTC GTGAGGTCAT TCGCAAGCCC GACATTATTG CCGGCTATAT GTCGTCGAAT CAACTGGTAT TCAACGATCT TCCCGCTCGC GGCATTCAGC GCGAAGAGCA ACCGTTGAAC GAAACCGGCA TGCTCGGCGC ACTGCCGTTT CTGCCGCGCC CGGATCAGAT TTCAGTCGCA GCAACGGGCG CTTCCAGCCA TCAGGTTGCT ATGGGACGGG TGCTGGTCAT TCAACAGTGC GCGGCTTGCC ACAATGTCAG CAACCAGACG GCGATCACCG TCTTCGATCA GCGGCTGGCG TTGCGTTCGC TGGCGCAGTT GCTCGAACGA CGCAAAATGA CGACCGCGCC AAAAATTGAG ACCTACCTGA ACGGCATTGG GGCGTTCCCA TATATGCATC CCGTCGTCGG CACGCCCGAA GAGCGGGCGG CGATGGCACT ATACCTGGAG TATTTCTTGC AACAACAGCA CGCACCACAG TCACAGGCGC AAGCCAGGAG GTGA
|
Protein sequence | MYPVWEGFFI HSAMFVAIVS AFHVLASHLT VAAAWFNLYL ERRAVYENRP ELYVYLKRSA LGLLVFAYVF GAMAGVGIWQ TTTAANPRGI STLIHNFVLY WGSEWYMFLI DVVGIIAYYY TFERVSPKTH LRLAWILALG GTGTLTIIVG ILSFKLTPGL WFETGASLNG FFNPTFWPQL FMRFALMFTI TAAWALLIVT GLPNGYFARE RIIRIAAVMG LGGLIVALGI WFFWYDPTLP AHAKTILRSP AIPPITFTVI IGGLIATFLG LLFALVMPRR QHQIIALGAM LVLFAAIFGA ERTREVIRKP DIIAGYMSSN QLVFNDLPAR GIQREEQPLN ETGMLGALPF LPRPDQISVA ATGASSHQVA MGRVLVIQQC AACHNVSNQT AITVFDQRLA LRSLAQLLER RKMTTAPKIE TYLNGIGAFP YMHPVVGTPE ERAAMALYLE YFLQQQHAPQ SQAQARR
|
| |