Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_1543 |
Symbol | |
ID | 5539019 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 1969932 |
End bp | 1971233 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640893681 |
Product | PUCC protein |
Protein accession | YP_001431654 |
Protein GI | 156741525 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.000848404 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACGCTGA TCAAGAACAT TCGCCTGGGG TTGCTGCACG TGGCGATTGC TATGACCTTC GTGCTGATCA ATAGCGTGCT GAACCGGATT ATGATCCACG ATCTCGGCAT TCTGGCGAGC GTTGTCGCTG TGCTGGTGGT GCTGCCGTAT ATCCTCTCGC CAGCGCAGGT CTGGATCGGG CAATATTCCG ATACCCATCC GATGTTTGGG TACCGGCGCA CACCGTATAT CGCGTTGGGC ACGCTGCTCG CGCTGGCCGG CGCAGCGCTG GCGCCGCACG CAGCCCTGGC GCTGGTGCGT GATCCGCTGA TCGGTGTACC ACTGGCGATT CTGCTCTTCG GGATGTGGGG TGTCGGGTAT AACCTGGCGG TCGTCGCATA CCTGGCGCTC GCCAGCGATA TGTCTACCGA GCAGCAGCGT TCACGCACAG TGGCGATCAT GTGGTTCATG ATGATTGCCA GCGTCATTGT GACTGCGATT GTCGTCGGGC GCGCGCTGGA GCCGTACAGT GAAGAGCGCC TCTTTACCGT CTTTCTGGAG ACTGGCGGCG TGGCGCTGGC ATTGGCGCTC GTGGGGTTGA TCGGTCTCGA ACCGCGGCGC GAACCTATTG CTGTGCAGCA GAGTCGCGCC GGACAGGTGG CGGCTATTCG CGCCGTACTC GACAATCCAC AGGCGCGCAT CTTTTTCGTC TACCTGATCA TGATGCTGGC GGCGATCCTG GGTCAGGATG TGCTGCTGGA GCCATTTGGC GCACAGGCGT TCGGAATGAA TGTCAAGGAA ACCACCCAAT TGACCGCAAT GTGGGGCGGC GCAACACTCT CGGCGCTGCT GCTGTATGGC GCTGTGCTCA GCCGCTGGAT GAGCAAGAAG CGCGGCGCGA TGATCGGCGG CTCGATTGCC GCGACCGGCT TTCTGCTGAT TGCCCTCAGC GGCATGCTCG CTATCGAAGC CATGTTCCTT CCCGGCATCG TGCTGCTTGG CTTTGGTACC GGCATTGCCA CAACCACCAA CCTGGCGCTC ATGCTCGACA TGACGACGGC TGAACAGGTC GGATTGTTTA TCGGTGCGTG GGGCGTAGCG GATGCATTGG CACGCGGGGT GGGCACGCTC CTTGGCGGCG TCATGCGCGA TGTGATTGCG CACATGAGCG GAAGCGCCGT CAGCGGTTAT GTCAGCGTGT TCCTGATCGA GGCGTTACTG TTAGGCATTT CTCTGGTATT ATTACAGCGC ATCGACGTAA CCGCCTTCCG CAGCCGCCAG CCGTCGCTGA CCGAACTGGT TGCGCTCTCT GGCGACGCCT GA
|
Protein sequence | MTLIKNIRLG LLHVAIAMTF VLINSVLNRI MIHDLGILAS VVAVLVVLPY ILSPAQVWIG QYSDTHPMFG YRRTPYIALG TLLALAGAAL APHAALALVR DPLIGVPLAI LLFGMWGVGY NLAVVAYLAL ASDMSTEQQR SRTVAIMWFM MIASVIVTAI VVGRALEPYS EERLFTVFLE TGGVALALAL VGLIGLEPRR EPIAVQQSRA GQVAAIRAVL DNPQARIFFV YLIMMLAAIL GQDVLLEPFG AQAFGMNVKE TTQLTAMWGG ATLSALLLYG AVLSRWMSKK RGAMIGGSIA ATGFLLIALS GMLAIEAMFL PGIVLLGFGT GIATTTNLAL MLDMTTAEQV GLFIGAWGVA DALARGVGTL LGGVMRDVIA HMSGSAVSGY VSVFLIEALL LGISLVLLQR IDVTAFRSRQ PSLTELVALS GDA
|
| |