Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_3774 |
Symbol | |
ID | 5541276 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 4946834 |
End bp | 4947787 |
Gene Length | 954 bp |
Protein Length | 317 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640895884 |
Product | UspA domain-containing protein |
Protein accession | YP_001433831 |
Protein GI | 156743702 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0589] Universal stress protein UspA and related nucleotide-binding proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.143188 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.471237 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAACCA TAGTTGTACC GCTCGATGGC TCCGAACTGT CAGCCCAGGT GCTGCCATAT GTTCGGGTGT TGGCGCCGCT GTTGGGGGCG CGGGTCCATC TGCTCCGCGT CGTTCAGGAG TCGCATCTTA GCAGCGGCGA GTCGTGGGCG CAGGTGCTCA TTTCGGTCTA TGGCGTTCCA GAGAGCGAAG CGCTGCGCCG CGATTATTAC GATGAATCGA TGGAGGCGCT TCGCCAGCGC GCGCAGGCGT ACCTCGACGC ACAGGCGAAG GCGCTCGAAG ATGCAGGGAT TGAAGTCGTC AGCGATGTGC GCTCCGGTTC ACCTGCCGAT GTGATCGTCG AAGTCGCCGC CGGATGCCTC CCCGGTGCAA TGATCGCAAT GGCCACGCAT GGCTACAGCG GCTTGCGCCG CTGGGCGCTT GGCAGTGTCG CCGATAAGGT CATTCATGCG ACCGAAGCGC CGGTGCTGCT GGTGCGCGGG CAAGCGCAGC CGATTGTGCA TCCGCCGCGC CGTATTCTCA TACCGCTCGA TGGCTCCGGG CTGGCGCGGC AGGCGCTGCC GCTTGCGAGC GAGATTGCGC GCGCCGCACA CGCCGAATTG ATCCTGCTCC GCGCGGTGGT GCCCATGATC GAGGCATACA TCGGTGCGCC CATGTTGGGT CGTCCTCTTG CCGAGAATAA CGAAGCGCTT GGCGCGCTGC ACGAGTACGC GCTGAACGAC CTGAACGCGG AAGCCGCCTC ATTGCGCGCT GAGGTTCCGC GTGTGCTTAC CCATGCGATT ATCGGGTATC CTGCCGAGGT GATCATCGAC GAAGCGCAGG CGATGGACGT CGACCTGATT GTGATGGCGA CGCACGGATA TGGCGGGTTG CGGCGCTGGG CGCTCGGCAG CGTGGCGGAT AAGGTGCTGC ACGCCACGAC GACGCCGCTC ATTCTGGTGC GTGCCGGCGA GTGA
|
Protein sequence | MQTIVVPLDG SELSAQVLPY VRVLAPLLGA RVHLLRVVQE SHLSSGESWA QVLISVYGVP ESEALRRDYY DESMEALRQR AQAYLDAQAK ALEDAGIEVV SDVRSGSPAD VIVEVAAGCL PGAMIAMATH GYSGLRRWAL GSVADKVIHA TEAPVLLVRG QAQPIVHPPR RILIPLDGSG LARQALPLAS EIARAAHAEL ILLRAVVPMI EAYIGAPMLG RPLAENNEAL GALHEYALND LNAEAASLRA EVPRVLTHAI IGYPAEVIID EAQAMDVDLI VMATHGYGGL RRWALGSVAD KVLHATTTPL ILVRAGE
|
| |