Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_1903 |
Symbol | |
ID | 5208864 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 2361412 |
End bp | 2362713 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640595512 |
Product | PUCC protein |
Protein accession | YP_001276242 |
Protein GI | 148656037 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00453505 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACGCTGT TCAAGAACAT TCGTCTGGGA TTGCTGCACG TTGCGATTGC GATGACGTTC GTGCTGATCA ACAGCGTGCT GAACCGGATC ATGATCCACG ATCTGAACAT TCTGGCGAGC ATTGTTGCCG TGCTGGTCGT GTTGCCCTAC GTGCTGTCGC CAGCGCAGGT CTGGATCGGG CAATACTCCG ACACCCACCC GATCTTCGGG TACCGACGCA CGCCGTATAT CGCACTGGGG ACATTGCTCG CCATCACAGG CGCTGCCCTG GCGCCGCACG CCGCGCTGGC GCTGGCGCGG GAGCCGCTGA TCGGCGTGCC GCTGGCGGTT CTGCTGTTTG GGATGTGGGG CGTTGGATAC AATCTCGCAG TGGTCGCCTA CCTGTCGCTC GCCAGCGATA TGTCCACTGA GCAGCAGCGT TCGCGGACTG TTGCGATCAT GTGGTTCATG ATGATCACCA GCGTCATAGT GACGGCGATT GTCGTTGGGC GTGCGCTGGA GCCGTACAGC GAAGAGCGTC TCTTCACCGT CTTTCTGGAG ACAGGCGGCG TGGCGCTGGC GCTGGCGCTT GTGGGGTTGA TCGGTCTCGA GCCGCGCCGC ACAACGGCGA CCGTGCAGCA GAGCCGCGCC GGGCAGATGG CAGCCATCCG CGCCATTATC GGCAATCCGC AGGCACGTTT CTTTTTCGTC TATCTCATCA TGCTGCTGGC GGCGATCCTG GGGCAGGATG TTCTGCTCGA GCCGTTTGGC GCGCAGGCAT TCGGAATGAA TGTCAAAGAA ACGACGCAAC TGACCGCGAT GTGGGGCGGC GCCACATTGA CGGCATTACT GCTGTACGGT GCGGTGCTCA GTCGCTGGAT CAGCAAGAAG CGCGGCGCGA TGATCGGCGG TTCGATTGCC GCAACCGGCT TCCTGCTGAT TGCGCTGAGC GGCATGCTCG CCATCGAAGC CATGTTCATC CCTGGAATCC TGCTCCTTGG TTTCGGCACC GGCATTGCCA CCACGACCAA CCTGGCGCTG ATGCTCGATA TGACAACAGC CGAGCAGGTC GGCTTGTTCA TCGGTGCGTG GGGTGTGGCA GATGCAATCG CCCGTGGCGT CGGCACGTTG CTTGGCGGCG TGATGCGCGA TGTCATTGCC CATATGAGCG GCAGCGCCGT CAGCGGCTAT GTCAGCGTCT TCCTGATCGA GGCAATGCTG CTGGGCATTT CTCTGGTATT ATTACAGCGA ATCGATGTGA CCGCCTTCCG CAGCCGCCAA CCGTCGCTGA CCGAACTGGT TGCGATCACT GGCGATGCCT GA
|
Protein sequence | MTLFKNIRLG LLHVAIAMTF VLINSVLNRI MIHDLNILAS IVAVLVVLPY VLSPAQVWIG QYSDTHPIFG YRRTPYIALG TLLAITGAAL APHAALALAR EPLIGVPLAV LLFGMWGVGY NLAVVAYLSL ASDMSTEQQR SRTVAIMWFM MITSVIVTAI VVGRALEPYS EERLFTVFLE TGGVALALAL VGLIGLEPRR TTATVQQSRA GQMAAIRAII GNPQARFFFV YLIMLLAAIL GQDVLLEPFG AQAFGMNVKE TTQLTAMWGG ATLTALLLYG AVLSRWISKK RGAMIGGSIA ATGFLLIALS GMLAIEAMFI PGILLLGFGT GIATTTNLAL MLDMTTAEQV GLFIGAWGVA DAIARGVGTL LGGVMRDVIA HMSGSAVSGY VSVFLIEAML LGISLVLLQR IDVTAFRSRQ PSLTELVAIT GDA
|
| |