Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_0486 |
Symbol | |
ID | 3746355 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | + |
Start bp | 565456 |
End bp | 568353 |
Gene Length | 2898 bp |
Protein Length | 965 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 637773020 |
Product | TPR repeat-containing protein |
Protein accession | YP_378802 |
Protein GI | 78188464 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5010] Flp pilus assembly protein TadD, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTAGATA ATAATTCAAC ACATATTCAG CCTGCTGGCG GTTTTGCGGC TATCAGCAAA TACAATCCGC ATCTTTGGAG TGCGGAACAA TTACGAGCTA TTTTTGTTGC ACGAACAAAC GAATTAGCTG ATTTGGTGCA GACGTTGCGC ATGGTACAAC CTGATACAGT AGCCCAACAT GTTTTGTTAG TTGGTGCGCG TGGAATGGGT AAATCCACTC TTATGCGACG TTTAGCATTA GCGGTGGAAG ATGATCCTTC GCTTTCTGCT AATTGGTTGC CGTTGCGGTT TCCCGAAGAG CAATACACCG TTGCTACACT TGGTCAATTT TGGGCTAACG TTCTTGACAG TTTTGCCGAT ACGTTACAAC ATCTTGGCGA ATCTGTTATA GCGTTGGATG CTGCTGCTGA GCGTATTGCA GCGTTACCTG TAACCCAACA GCCCGAAGCC TATATAGATG CCATTAATCA CTTTGCTGAT GAACGTAAGC AACGGTTGTT ACTTTTGGTG GATAACACGG ATATGCTGCT CCATAACATT GGGAAAGATG CTCATTGGGG ATTACGTGCA ACGTTGCAGA GCAATCCTCG CCTTTTCTGG ATTGGTGGTA GTTACCAAAG TCTTGAAGCT GAGAGCAATT ATCACGATGC TTTTCTTGAC TTTTTTCGAG TTATCAACTT GCGCCCGTTA AAAGTGGAAG AAATGCGTCA AGCGTTGCTT GCTTTAGCTG AAACCTTTGG TGGTGCTACA GCACGAAATG CAATGGTGCA CCAACTTGAT CTTCAGCCAG AGCGCTTGCC CACTTTGCGC CAACTCTCAG GTGGTAATCC ACGTACAACG GTGATGCTGT ATGAAATTCT TGCCAATGGT CAAAATGGCA ATGTACGTAG CGATTTAGAA GCGCTGCTTG ATAATATGAC GCCTCTTTAT AAAGCTCGTA TGGATAGCTT AGCTGATTTA CAACGCAAAC TGCTTGCCCA TATTTTAGAG CATTGGGCAC CTATCTCTTT TGGCGAATTG GCTGCGGTTT CTCAAGTAGC AAAAGGCACC ATTAGCCCAC AGTTGCAGCG TCTTGAAATA GAAGGACTTA TTGAAAAAAC TTCTTTGCAT GGTACTACTC GTAGCGGCTA CCAAGCGGCT GAACGCTTTT TTAACATTTG GTATTTAATG CGCTTTTCAC CACGTCGGCA GCGCAATCGT CTTGTTTGGT TGGTGGAGTT TATGCGCTTA TGGTTTAGTG GCGATGAACT CTGCTCTTTA GCAAAACAAC GAATGAGTGT TGGTAGCAAT GATTTGCGTT CAACGCATGA CTTGGAATAT GATCGAGCAT TAGCTGATGC TCTACCGCAG TTTGCGCCCG AACGCCACGC TTTGCGCTGG TCGTTGCTCA AGCTGTTACA AGAAAACAAC TCTCAGTTAG TGGAGCTTTT CGATTTTGAC GGTGAGGATA AAGAGTTTAA AGGCGCTACT GATTATCTAC GCCGTTTAGC TGCACTACCG TCACTGTTAC GCCAATGCCC TCATGCGGTA ACTGAGCAAG AGAAAACGCA TTGGGTTGAA ACTGTGCTTG GTTCGATTAG CTTAACTTTG GAAGAGAAAG AAATTGTAGC TCAAAAGGCA GAATATTTAA CTCTTTTTCA ATATGACGAG TTATTAAAGG TTTTTAGTGA AGAACAAAAA AGATGGGAAA AACAGTTTGG GGTGGCTGCT TTGCAAACGG TACGTAGTGC TGTGTTGAGC CAAGATTTTT TTTCTGATAT GCCTGATTCA CATCTTGCTT ATGAGCAAGT TCGTGTCTGT TTTGCTAACA ATAAAGAAGC GTTACGTTTT GTTAGTTTAC TTTTTTATAG CAAGCATAAA GACGAGTGGA GCTACAAAGC CCAGAAATTA GCACTGAACC TATTGCACGA TGATTCTAAA AGTGATTGGT TCCTTCGAGA AAAGCTTGTG CGTTATGAAG AAACCGAAAA AGCTTATTGC AAAGCAATTG AATTTAATAA GGAAGATGCT GTTACTTGGA ATCACGTAGG TAACTTACTA AAAGATTATC TCGGTCGTTA TGAAGAAGCA GAAGCAGCGT ATCGTCAAGC GATAGCTATA GATAAGAAGT TTGCTTATCC ATGGAATAAT TTGGGTCAGT TATTGCATTA CAATTTGAAT CGCTATGAAG AATCGGAAGC AGCGTATCGT CAAGCAATAG CATTGGATGA GAAGTATGCT TATCCATGGT TTAACTTGGG GCAACTACTG CACTATAAGC TCGAACGTTA TGAAGAATCG GAAGCGGCGT ATCGTCAAGC GATAGCTATA GATGAGAATA ATGCTTATCC ATGGAATAAT TTAGGTCAGT TATTACATGA ATGGTTAGGT CGTTATGAGG AAGCAGAAAC AGCGTATCGT CAAGCGATCG CGTTAGATGA GAAATACGTT TATCCTGTAA CAAATCTTGC TCGCTTGTTG GCTCAGCGTA ATCGCAAGGC AGAAGCCGAA ACCTATTATC GTGAAGCTGT CTTAAAAGAC ACTCAAGACA CTCAACAATT ATTCCTTCAA GCGCATCTTT TCTTAGGAAA TCGTCAGTTA GCTATGGACG CATTGCAAGC CTTAGCTGAA AAAGCGCAAA ATGGTAATCA ATATGCGTTT TACCGTCTCA AAGAACAGGT TTGGGAGTGC TATGAGCTTG GGTTAGGTGA ACGCTTAGCA GATTGGATGG CAGAAAGTAA TGTGGCAGAG TTCCTTACGC CATTTATTCA AGCTCTTTAC ACGCTTGCAG GTGTTAATGA GAAATTGCGC GACTTACCAA TGGAAAGCCA ACACATGGTA GATGAGATTG TTCGCAAAGC GCGTTTGCGG CAGGAAAAAC GAGAGGCTTG CAACATGCGG GCAAAATCAA TTCATTGA
|
Protein sequence | MLDNNSTHIQ PAGGFAAISK YNPHLWSAEQ LRAIFVARTN ELADLVQTLR MVQPDTVAQH VLLVGARGMG KSTLMRRLAL AVEDDPSLSA NWLPLRFPEE QYTVATLGQF WANVLDSFAD TLQHLGESVI ALDAAAERIA ALPVTQQPEA YIDAINHFAD ERKQRLLLLV DNTDMLLHNI GKDAHWGLRA TLQSNPRLFW IGGSYQSLEA ESNYHDAFLD FFRVINLRPL KVEEMRQALL ALAETFGGAT ARNAMVHQLD LQPERLPTLR QLSGGNPRTT VMLYEILANG QNGNVRSDLE ALLDNMTPLY KARMDSLADL QRKLLAHILE HWAPISFGEL AAVSQVAKGT ISPQLQRLEI EGLIEKTSLH GTTRSGYQAA ERFFNIWYLM RFSPRRQRNR LVWLVEFMRL WFSGDELCSL AKQRMSVGSN DLRSTHDLEY DRALADALPQ FAPERHALRW SLLKLLQENN SQLVELFDFD GEDKEFKGAT DYLRRLAALP SLLRQCPHAV TEQEKTHWVE TVLGSISLTL EEKEIVAQKA EYLTLFQYDE LLKVFSEEQK RWEKQFGVAA LQTVRSAVLS QDFFSDMPDS HLAYEQVRVC FANNKEALRF VSLLFYSKHK DEWSYKAQKL ALNLLHDDSK SDWFLREKLV RYEETEKAYC KAIEFNKEDA VTWNHVGNLL KDYLGRYEEA EAAYRQAIAI DKKFAYPWNN LGQLLHYNLN RYEESEAAYR QAIALDEKYA YPWFNLGQLL HYKLERYEES EAAYRQAIAI DENNAYPWNN LGQLLHEWLG RYEEAETAYR QAIALDEKYV YPVTNLARLL AQRNRKAEAE TYYREAVLKD TQDTQQLFLQ AHLFLGNRQL AMDALQALAE KAQNGNQYAF YRLKEQVWEC YELGLGERLA DWMAESNVAE FLTPFIQALY TLAGVNEKLR DLPMESQHMV DEIVRKARLR QEKREACNMR AKSIH
|
| |