Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_4046 |
Symbol | |
ID | 5211029 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 5068776 |
End bp | 5069822 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640597634 |
Product | hypothetical protein |
Protein accession | YP_001278340 |
Protein GI | 148658135 |
COG category | [C] Energy production and conversion |
COG ID | [COG1592] Rubrerythrin |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATATTG TTGAGTTTTC ACCAGACATA CGGATTGGTG CTATGAGCGA CACTCCCAAT GATGCTGCCC GGATGTTCGC TCTTGCGAGC ATGGGACATG CCGCATACCA TCTCTGGGCG GAACAGGCGC GTCGGGATCG CTGTTTCAAC ATTGCGCGTC TGTTCGAGGC GTTGAGCGCT GCGCGTCTGG CGCGCGCCGG GAACGCCTTC CGCCGTCTGG GGCTTGTGCG TTCGACGGCG GAGAATGTTG CCAGCGCTTT TTCCGGTGCA GGCATCGGCG ACATTCCTGC CGACCGGATC ACCGGCGTGA CGCCGTTTGC GCGGGAACTG CTGGCGCGGG CGCAGCGCGC CGTGGCTGAA GGGCGCGATC TGCGCGCCGG CGAACTGGGT GATCTCTTCG TCTGCACCAC GTGCGGCGAG ATCCGCGAAG GTGCGCTCGA AGGCGCGTGT CCGCGCTGTG GCACAGTTCC TGAAGCGCAC AAAGCGTTCC GCGCCATCGA AGCAATGGGA ACGCTTGGTC CGCACGCAAT TATGACCTTT CTGGAACATA CGGAGGAGGC GATCCGAACG CTGGTGGCAG GGCTGGACGA GGAGATGCTC TCCCGGCGCC TGAATGAAAC CACACCGTCG TTGAAAGAGG TGATCGGGCA TCTTGCCGAT ATGGACGCAA TCTTTCGTCA GCGCGCCTGG TTGCTGCTCG AGACCGTGCG ACCGGTTCTT CCGCCAGCGC ATCCTCCAAC CCTGGAATCG GCGGATGTGT ATCGTGACCA ACCGATTGAC CGGGTGATGG AAGCCTATCA CGCAACGCGG GCGCAAACCC TGAACCTGCT GCGCGGATTG ACCAGCGCGG CGTGGCATCG GGAAGGGGAC CACGAGGTGT ATGGAGTGAT CAATCTGTTG CATCAGGCGA ACTGGCTTAT ATCGCACGAA CGTGCGTATC TCGTTGAAAT GGCGCAGATC CGTCATAACC TGATCGCCGC CGATCGGCGC TATTGCGAAG CGGAAGTGAC CGATATCGTT GTGACCGGCT CGCACGAAGG AGAGTGA
|
Protein sequence | MDIVEFSPDI RIGAMSDTPN DAARMFALAS MGHAAYHLWA EQARRDRCFN IARLFEALSA ARLARAGNAF RRLGLVRSTA ENVASAFSGA GIGDIPADRI TGVTPFAREL LARAQRAVAE GRDLRAGELG DLFVCTTCGE IREGALEGAC PRCGTVPEAH KAFRAIEAMG TLGPHAIMTF LEHTEEAIRT LVAGLDEEML SRRLNETTPS LKEVIGHLAD MDAIFRQRAW LLLETVRPVL PPAHPPTLES ADVYRDQPID RVMEAYHATR AQTLNLLRGL TSAAWHREGD HEVYGVINLL HQANWLISHE RAYLVEMAQI RHNLIAADRR YCEAEVTDIV VTGSHEGE
|
| |