Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_4211 |
Symbol | |
ID | 5541722 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 5446724 |
End bp | 5448292 |
Gene Length | 1569 bp |
Protein Length | 522 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640896318 |
Product | TPR repeat-containing protein |
Protein accession | YP_001434256 |
Protein GI | 156744127 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5010] Flp pilus assembly protein TadD, contains TPR repeats |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.185796 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTCGA CAGAACCGCT CAATTTCATG CCGCTCGATC TGGACACGTT CAACGGAAGC GAGCGATTCA TGGCGGGGAC GCGCCTGGGG GCGGCGTTTG GACAGGGCAT TCGCGCCTAC CTGCGCGCTG ATTACGCAAA TGCGATCGAG CACTTCAAGG CTGCGTTGAT CGCCGCCTAT ATCGAGGGAG AAGAACGTGC TCAGATCTAT GATCGCGAAC GTGCGATCAT CTATCTCTAC ATCGGTAATG CACTGGCGTA CCAGGAGGAT TGGGAAGGAG CGCTGCGCGA GTATCTGGAA GCGGTGCAGA CCGATCCGCA ACTGTCGGAG GCGCACTACA ACCTGGGCGT GGCATTTGCC GCGCAGGGGC GTCTTGATCG TGCGATTGCC GCGTTCAAGG AAGCCATCGA GCATAATCCG CGCCTGTACG AAGCGCACTT CTCGCTCGGA CGCTGCTATC AGCGTCTCGA CGACGCCGGG CGAGCGTATA TTCACTACGA CCAGGCATGT CAGGCGCGTC CTCAGGCGGC CGAGCCGCGC TACTACATGG GGTTGATGCA CCAGAGCCAC GGCGCGCACG AACTGGCGCA GCGCTGTTTC GCCGAAGCGT TGCGCGTCGA GCCAACCTTC GTCTCGCCGG AGTTGCAGGA CGAAGTGCTG GTCAATCGCT CGGAAGAAGA AGTCGCTCAG TGGTACTACC GCCTCAGCAA CGATCTGAAG CAGCAGGGGT ACGAAGAGGA GGCGGAGCGG ATCTACCGGG CATTGCTCCA GTGGCGGCCA GAAGAACACT ATGCGCGCTA TTTGCTCGGC AACCTGCTGG CGCGCGCGCG GCGGCTCGAT GAGGCGCTCG AAGAGTATGC TCAGATCCCG CCACAGGATA AATATTATGT CGATGCGCGT ATTCGTATCA GTGCTATTCT CAAGTTGCAG AACAAAACAC GCGAGGCGTA TGACACCCTC TTCGAGTGCG CCAGGCTGCA CCCTGCCAAC GGTCAGTTGT TTCTGAACAT GGGCAAGCTC CTCTACGATA TGAACAAACA TGCGGGCGCT ATCAAGGCAT TCGAGCGCGC GGTGCAGTTG CTCCCCAACG ATCCGCAGGC GCACTATCTG TTGGGGTTTA TGTACAACCT GATGGGTCGC GAGGGGTGGG CGCTGGCAGC CTGGCGCAAG GCGGTCGAAC TTGCGCCGGA CGCGCATTCG CTGCGCTACG ACCTCGGCTA TATGTATGTG CGACGCAATC GCTACGATCT GGCGGCGAAA GAGTTTGCGC GCGTGCTCCA ATTCTGGCCC GACGACGTCG AAACGAACTT TATGCTTGGG TTGTGCTACA AAGAATTGAT GGAACCGGCG CGCGCCATTC CGCTGTTCGA GAAAGTGCTG CGACGCAATC CGCGCCACGT GCAGGCGCTC TACTATCTGG GCGCATCGTA CTTGCAGATC GGCAACACCT CGCTGGGGAA AGCCTATCTC AGACGCTACG ACTACCTGGC GAGCCAGGAG CAGACAAGCC CGCCCACGAC GCGTCGCGCG ATGCGGCAAC GCAGCGTCGG GATGGTCGGT TCATCGTGA
|
Protein sequence | MSSTEPLNFM PLDLDTFNGS ERFMAGTRLG AAFGQGIRAY LRADYANAIE HFKAALIAAY IEGEERAQIY DRERAIIYLY IGNALAYQED WEGALREYLE AVQTDPQLSE AHYNLGVAFA AQGRLDRAIA AFKEAIEHNP RLYEAHFSLG RCYQRLDDAG RAYIHYDQAC QARPQAAEPR YYMGLMHQSH GAHELAQRCF AEALRVEPTF VSPELQDEVL VNRSEEEVAQ WYYRLSNDLK QQGYEEEAER IYRALLQWRP EEHYARYLLG NLLARARRLD EALEEYAQIP PQDKYYVDAR IRISAILKLQ NKTREAYDTL FECARLHPAN GQLFLNMGKL LYDMNKHAGA IKAFERAVQL LPNDPQAHYL LGFMYNLMGR EGWALAAWRK AVELAPDAHS LRYDLGYMYV RRNRYDLAAK EFARVLQFWP DDVETNFMLG LCYKELMEPA RAIPLFEKVL RRNPRHVQAL YYLGASYLQI GNTSLGKAYL RRYDYLASQE QTSPPTTRRA MRQRSVGMVG SS
|
| |