Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_0031 |
Symbol | |
ID | 5206964 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 31829 |
End bp | 33397 |
Gene Length | 1569 bp |
Protein Length | 522 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640593665 |
Product | TPR repeat-containing protein |
Protein accession | YP_001274424 |
Protein GI | 148654219 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5010] Flp pilus assembly protein TadD, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.918489 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTCGA CAGAACCGCT CAATTTCATG CCGCTCGATC TGGACACGTT CAGCGGGAGC GAGCGATTCA TGGCTGGAAC GCGCCTGGGC GCGGCGTTCG GTCAGGGTAT CCGCGCCTAC CTGCGCGCTG ACTACGCCAA TGCGATCGAG CATTTCAAAG CCGCTTTGAT CGCTGCATAC ATCGAAGGTG AAGAACGCGC TCAGATTTAT GATCGCGAAC GGGCGATCAT CTATCTGTAC ATCGGCAATG CGCTGGCGTA TCAGGAGGAT TGGGAAGGGG CGCTGCGCGA GTATCTCGAA GCCGTGCAGA CCGATCCGCA ACTGGCTGAG GCGCACTACA ACCTGGGGGT GGCATTTGCG GCGCAGGGGC GGCTCGACCG CGCGATTGCC GCATTCAAGG AAGCCATCGA ACACAATCCG CGCCTGTACG AGGCGCACTT CTCACTGGGA CGCTGCTATC AGCGCCTCGA TGATGCCGGT CGCGCGTACA TTCACTACGA TCAGGCGTGT CAGGCGCGCC CGCAGGCCGC AGAGCCGCGC TACTACATGG GATTGATGCA CCAGAGCCAC GGCGCGCACG AACTGGCGCA GCGTTGCTTT GCCGAGGCGC TGCGCGTCGA GCCGACCTTC GTCTCGCCAG AATTGCAGGA CGAGGTGCTG GTCAACCGCT CGGAAGAGGA AGTCGCCCAA TGGTACTACC GCCTCAGCAA CGATCTGAAA CAGCAGGGGT ACGAGGAAGA GGCGGAACGG ATCTACCGCG CGCTGCTCCA GTGGCGTCCC GAAGAACATT ATGCCCGTTA TCTGCTCGGC AATCTGCTGG CGCGCGCGCG GCGTCTCGAT GAAGCGCTTG AAGCGTATGC CCAGATTCCG CCACAGGACA GATATTACGT CGATGCGCGC ATTCGGATCA GCGCGATCCT CAAACTTCAG AACAAGATGC GCGAGGCGTA TGATACCCTG TTCGAGTGCG CCAAACTGCA CCCGACCAAT GGTCAGTTGT TCCTGAATAT GGGTAAGTTG CTCTACGATA TGAACAAACA CGCTGGCGCT GTCAAAGCGT TTGAGCGCGC CGTGCAACTG CTCCCCAACG ATCCGCAGGC GCACTACCTG CTGGGGTTTA TGTACAACCT CATGGGACGC GAGGGATGGG CGCTGGCAGC CTGGCGCAAG GCAGTGGAAC TCGCTCCGGA CGCGCATTCT CTGCGCTACG ACCTTGGCTA CATGTACGTG CGACGCAACC GCTATGACCT GGCAGCAAAA GAGTTTGCCC GCGTGCTCCA GTTCTGGCCC GATGATGTCG AGACGAACTT TATGCTCGGA TTGTGCTACA AAGAACTGAT GGAACCGGCG CGAGCCATTC CGCTGTTTGA AAAAGTGCTG CGTCGCAATC CGCGCCACGT GCAGGCGCTC TATTATCTCG GCGCTTCGTA CCTTCAGATT GGCAATACAT CGCTTGGCAA GGCGTATCTC AGGCGCTACG ACTACCTGGC GAGCCAGGAA CAGTCGAGTG CGCCTGTGAC GCGCCGGACC ATGCGCCAGC GCACTGTCGG CATGCTAGAG TCGTCGTAA
|
Protein sequence | MSSTEPLNFM PLDLDTFSGS ERFMAGTRLG AAFGQGIRAY LRADYANAIE HFKAALIAAY IEGEERAQIY DRERAIIYLY IGNALAYQED WEGALREYLE AVQTDPQLAE AHYNLGVAFA AQGRLDRAIA AFKEAIEHNP RLYEAHFSLG RCYQRLDDAG RAYIHYDQAC QARPQAAEPR YYMGLMHQSH GAHELAQRCF AEALRVEPTF VSPELQDEVL VNRSEEEVAQ WYYRLSNDLK QQGYEEEAER IYRALLQWRP EEHYARYLLG NLLARARRLD EALEAYAQIP PQDRYYVDAR IRISAILKLQ NKMREAYDTL FECAKLHPTN GQLFLNMGKL LYDMNKHAGA VKAFERAVQL LPNDPQAHYL LGFMYNLMGR EGWALAAWRK AVELAPDAHS LRYDLGYMYV RRNRYDLAAK EFARVLQFWP DDVETNFMLG LCYKELMEPA RAIPLFEKVL RRNPRHVQAL YYLGASYLQI GNTSLGKAYL RRYDYLASQE QSSAPVTRRT MRQRTVGMLE SS
|
| |