Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_3987 |
Symbol | |
ID | 5541497 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 5198892 |
End bp | 5202170 |
Gene Length | 3279 bp |
Protein Length | 1092 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640896099 |
Product | TPR repeat-containing protein |
Protein accession | YP_001434038 |
Protein GI | 156743909 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5010] Flp pilus assembly protein TadD, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.017736 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAGGCA ACCGTGCATT GTTCGACCGC GCGATGGAGC AAAGCCGCGA AGCAGCGCGG CTTATGAACT GGGACGAGGC GCTGAAACAG GCAGCCCGCG CCTTGCAAGA ATTCCCTCAG GACCTGGATG CGCGCTTGGC TGTCGCTGTT GCGTTCTTCA ACACCGCAAA GTATGCGCAG GCGCTTCAGA TTTTCGATGA ATTGCGTCGC ATCGATGCCG GCAATCCATT CTATCTGGAA TATCTGGCGC GTACCCACGA GCGTCTCGGC GACGTGCAGG CGGCAACCAA CGCCTATGTG CAACTTGCCG ATCTTCAGAT GAACCGCAAA CTGGCTGCAC GCGCCATCGA TGCCTTACGC GAGGCGCTAC GCCTGCAATC CGACGCCGAC GATCAGCGGC TGCGTCTGGC GCAGTTGCTC GCCGATCAGG GCGCGAGTGC AGAGGCGGCA ACGCACTATC TCGAACTGGC ACGGCGCGCT CAAGCGCGTG GGAGCCTCGA ACAGGCGGCG GAACTGGCGG AAATGGCGTT GCGCTGCGTA CCCGACAATC GTGAAGCCAA AGAGTTAATC GCGGCGCTCC ACGATGCCCT GGCACAGACG GTCCAATCAG CAATCGAAGC GACCGCAACC GCCGCCGATG CAACCGTATT GCCCATTATC GGCACCGGCG GTTTGCGCGG TGCGCACCTC GCCATCGAAC GAATCATCGC GCTGGCGCAC GAACGTCAGG AGGCCGGCGA TATTGATGGC GCCATCGAGC AGTATGAACG CGCGCTCAAA CTTGGAACCG ACCGCAGCGA TGTGTTTTAT AGCCTGGGCT TGCTGTATCA GGAACGCGGC GACTATCAAC GCGCGGTGGA ACTGTTGCAG GGCGCCGCTG GCGACCCGGA ATACGCCCTT TCAGCGCATT ATATGCTTGG TCAGGCGTAT CAGGCGCTGG GCCAACTGCC CGAAGCAGCG CACGAATACG AACAAACTAT TCGTCTGTTG CCGCTGGAAT CGATTGGACG CGCCGAAGCC GACGATATGA TCCAGATGTA CGAGAGCGCC GCGCAGATCT ATATTCAACT CAACGACATT GCGCGCGCTG CAACGCTCTA TTCGACGCTG GCAAACTTTC TTCAGGGCAA ACGCTGGGGA CGCGAGCGCG CCGATGAGTT CCGCCAGAAA GCCAAAGACC TGACCGAGCG CAATATGTTT GCCAAACTCC GCACGCTCGG CACGGGCGCA CTGACATCAC CGCCATCCGC CCCTGAACCC GAAACGCCTC CCGAAAGTCC GATGCCCGAA ACCTGGGGGA AAATTCGCCC TATTACCGAT TTTCTGCGAG CGCCTGAGCC ACAGACCCCC GACCATCATC GCTTTGAACC ATCCCCATCA GTCGCCGCCG AACCGGTCGA CCCGCTGGCA ATCCTCGAAA CGCTGCCGCA TCCCGAACAC GTTCCCGTTG CACCGGTAAC ACCGCTCGAC ACGACCGGAC TGGATGAAAT CGGTGAACGG TATGTGCTGG CGAGCGAAAA GTATGTCGAA CAGGGATTGA TGCTGGCAGC CAACGACGCA TGTATGGAGG TTATCAGGCT GAACCCGGAG TATCTGCCGA TTCACCTGCG ATTGGGAGAG ATTTACGAGC GCGATGGACG CAAAGACGAA GCATTAATCA AGTATCAACT GTTGATCGAC ACGTATGTGG CGCGGGGAGA ACCCCGGCGC GCCATCGACG TTTACTACCG GCTGATCGAA CTGTCGCCCG ATACAATTAT GCCGCGTTCG CGGCTGGCTG AGTTGCTGCG CGCTGACGGA CGAAACGAAG AGGCAGCGCA GCAACTTGCC ATTGTTGCCG GCGCCTATTT TCGGATGGGA CAAACGACCA AAGCGCTCGA GGAGTACCGA CGCGCCCTGC AATGGTCGCC GTCCAATGCA GAACTCCACG CACAGTATGG GCAGGCGCTG CTGAAACTCG ACCGCGCCGA GGCAGCGCTG GTCGCTTTCC GGCGCGCTCT CGAACTCGAT CAGCAGAATC CGGTTCATAT TGCACGCATC AACATGGCGC TGGCCATTAT GGGTGAGCAA CCCGTCGCCG TCTGGCAATC GCTGGCGACG CTGCTCGACC AGCTCAAACA GCATCCCCAA CGCCTGAATG AAGTACAATC CGAGTACCGC GCCACGTTAC TGGTTGCCGA TCTTCCCATA TTGCACTACA TTCTGGGCAT TGTTCAGCAG CACGCCGGGC AACATCCATC GGCGCTGCTG GAGTTCGAGC AGGCCATCGA ACTGCTGAAC GCCGAGAACG ATCCGACTCT GACGCCACAC CTGGTTTATC AGGCAATGGC GGATAGTCAC ATTGCACTGG GTCAGGCGAG TGAGGCGTTG CGCCAGCTTC AGCATGCCCT GGATCTTGCT CCTGCGCCGC CGCCGCCGGA AAACGCCCGT TATCCGTTTT CGCTCCCCCT TTCTCAGGGC GAAATCGTGC GCCGTATGGC GGAAGCATAT GCCGCTGTCG GCGATCTCGC CAGCGCCGAA CGCGCGCTTC AGGAAGCCAA ACAGTTTCTG CCCTACGACC GCGCGATCTA CACCAAACTG TCCGATATCT ATTTTCGGCA GGGTCGCCTG AATGAAGCGC TGACGCAACT CGACGAACTG GCGACCCACT ATGAGCAGCG CCAGATGCTG GATCGCGCGA TCGAGGCACT CGAAAATGCG CTGCGCCTGG CGCCCAACAA TATTCCGATC AGCCATCGCC TGGCGAAAAT GTATATCCGG CGCGGCTACC TGGATAAAGG CGTCGAAGCG CTGACGCGTG TCGCAGAACT TCAACGGAAG GAAGGGCAGA TCAAAGACGC CATTGCCAAC CTTCAGCAGG CGGCCGAGGT GCATTGGACG CTTGGCAGGC AGGAGGAGGC GCGCGCACTC TATGACAAAA TCGTGCATAT CGCACCCAAT GATATTGAAG CACGCCAGTG GCTGTCGTTT ATGTACACCC TGGCCGGCAT GACGCGTGAA GCCGTTGCGC AAAAGAAACA GATTATTCGC ATCCTGCTCC AGCGGCGCGA TCTGGATAAC GCTATTGCGG AGATGCATCA AATCTACGGA CTCGACCAGA ACGATACCGA TAATCTGTTT CAACTCGGTG ATGCGCTTAT GCGGCGGCAG GAATATGAAC AGGCAATCCG CATCTATACC AGGCTGGCAA AAGTGCCAGG CGTCGAAATT GAACGCGTCG AAGCGTTACA GGCAGCAGCC AGACGGATGC TCGAACAACA ACAGGCAGGA AACCGGTGA
|
Protein sequence | MPGNRALFDR AMEQSREAAR LMNWDEALKQ AARALQEFPQ DLDARLAVAV AFFNTAKYAQ ALQIFDELRR IDAGNPFYLE YLARTHERLG DVQAATNAYV QLADLQMNRK LAARAIDALR EALRLQSDAD DQRLRLAQLL ADQGASAEAA THYLELARRA QARGSLEQAA ELAEMALRCV PDNREAKELI AALHDALAQT VQSAIEATAT AADATVLPII GTGGLRGAHL AIERIIALAH ERQEAGDIDG AIEQYERALK LGTDRSDVFY SLGLLYQERG DYQRAVELLQ GAAGDPEYAL SAHYMLGQAY QALGQLPEAA HEYEQTIRLL PLESIGRAEA DDMIQMYESA AQIYIQLNDI ARAATLYSTL ANFLQGKRWG RERADEFRQK AKDLTERNMF AKLRTLGTGA LTSPPSAPEP ETPPESPMPE TWGKIRPITD FLRAPEPQTP DHHRFEPSPS VAAEPVDPLA ILETLPHPEH VPVAPVTPLD TTGLDEIGER YVLASEKYVE QGLMLAANDA CMEVIRLNPE YLPIHLRLGE IYERDGRKDE ALIKYQLLID TYVARGEPRR AIDVYYRLIE LSPDTIMPRS RLAELLRADG RNEEAAQQLA IVAGAYFRMG QTTKALEEYR RALQWSPSNA ELHAQYGQAL LKLDRAEAAL VAFRRALELD QQNPVHIARI NMALAIMGEQ PVAVWQSLAT LLDQLKQHPQ RLNEVQSEYR ATLLVADLPI LHYILGIVQQ HAGQHPSALL EFEQAIELLN AENDPTLTPH LVYQAMADSH IALGQASEAL RQLQHALDLA PAPPPPENAR YPFSLPLSQG EIVRRMAEAY AAVGDLASAE RALQEAKQFL PYDRAIYTKL SDIYFRQGRL NEALTQLDEL ATHYEQRQML DRAIEALENA LRLAPNNIPI SHRLAKMYIR RGYLDKGVEA LTRVAELQRK EGQIKDAIAN LQQAAEVHWT LGRQEEARAL YDKIVHIAPN DIEARQWLSF MYTLAGMTRE AVAQKKQIIR ILLQRRDLDN AIAEMHQIYG LDQNDTDNLF QLGDALMRRQ EYEQAIRIYT RLAKVPGVEI ERVEALQAAA RRMLEQQQAG NR
|
| |