Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_0341 |
Symbol | |
ID | 3748061 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | - |
Start bp | 384212 |
End bp | 386470 |
Gene Length | 2259 bp |
Protein Length | 752 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 637772868 |
Product | TPR repeat-containing protein |
Protein accession | YP_378657 |
Protein GI | 78188319 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5010] Flp pilus assembly protein TadD, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.132358 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGTAA GGAATCTTGT GCTGTTTACA CTGATGGCAG TTTCAGCGGA GGGCTATGCT GCTGGTACTA AGGCCAAATC TATTGTAAAA ACACCCTCCG CAGCAACACA AGCAGCCGAA ACGATGCAAG CGTTGCCATT AGATCGCCAA ACAGCGTTAA ATTTAGCACA GAGTTACCTT GCTAATGGAT CAAGCAGGCA AGCTGAGCTT ATTTTAAGCA AGCTTGTACT ACTTTACCCT GACGATGAAG AAATTTTAAG AGAAACTATA TCGCTTTACG AAAAAAGTAA TAGAGCAGAG CAAACTCTAC CTCTCTACCA GCATTTGTTA CAGCTACGAC CAAATGATTT AGAGCTAACA CTTGCAAGTG CAAGAGCCTA TTCATGGACA GGAAGAAAAG CTGAATCAAT AGCTCTGTAC GAAAAAGTAC TCAAAGCAGG TAATGCCTCC GAAAAAGTTG TAACTGAATA TGCTGATTTT TTATATGCAG ATAAGCAATA CCAAAAAGCT ATTGATCTCT ATAAAAGCGT TGGACAAAAG GGTAAGCTAA GTAAGCAGCA TATGCTTAAT ACGGTGAATG GTTTTATTGC ATTAAAAAAG TTTGATGAGG CAGCTAAAAT TTGTAATGCA AATGTTCCCC TTTACCCTCA GGATACCGAC TTTTTAAGAT TAGCGGCTGA TATCAATTTT AATGCAAAGC AGTTTGAAGA GGCAGCAGGT CATTATCGCC AGTTATTGCT TAAGAATCCA GACGATCCGG GTGCCTATAG CAAACTTGCA GATATTGCAA TGGCGAAAAA TGATTTCACT GAAGTTGCTC GCCTAAGCCA TAAAATTTTA GCCTTGATTC CTGATCACAA AACAGCAATG CTTTCTTTAG CGAGAGTTTC AAGCTGGCAA GGCGATTTTA CAACTTCGTT AACTTATTAC GATAAACTTA TTGCCTCTCC CAACCCTGAG CCTTTTTATT ATCGGGAAAA AGCACGAGTT TTAGGATGGA TGGGGGATTT TAAACAAGCT TTAAGTGTTT ATAAAAGCGC TGTTCAAAAA TGGCCTGACG ATAAAGCAAT AAGCGCTGAG GCTGAAGCCA AGAAAAATTA TTATAACCAC ACCTATCGCC CTGCTGTTAA AGCATATAAT GCTTGGCTCT TAGCAGAACC GCAACAACCC GAAGCTCTGT TTGATCTTGC TCAGCTTTAT GCGCAATATG GAAAATGGAA TAATGGGTTA AACACCTACA ACTCACTTTT AAGTCAAATA CCTGCTCATC GTCAAGCTGC ACTTGCTAAG CAAAAAATTG ATTTTGCGGC ATCAAGAATG TTTGTGCGTA GTGGTGTAGA ATATTTTAGT GCAAAAACAA AGTTTATTGA TGCTACAAAT CATAGGCAAG CTGATACAAA AAGCACTTCA ATCTATTCTT CGCTTACATA TCCAATAAAT GAGCGAGTGT CTGCTTTTGT AAATCTTGAT AGCAAGTCAT ACGATTTTCG TATTGCTAAG CCAAATACTC CAAAAAATCC TGTTACTTAT GGCTTGATGG CAGGTGCAGA ATATCGCAAT ATGCCGAATA TTGCACTCAG TGCTGGATTG GGTATGCGTA TGAATCCCGG TGATGTAGAT AATGGTCTCA CTGGTTTTAT TAATGCGAAT AGTCAGCCCG TTGATAACCT TCATGTTGGA GTAACGTTGC GCAATGATGA TATTGTAACC AACACCTCTT CATTTAATAA TCAACTTGAA GCCACGCGTC TGCAAGGACG AGTTGCTTAC AATGGATATC GCCGTTGGCA AGCTGGTATG GATATTGCTT TTGATAGCTA TGCAGACAAC CAATCTTACG ATAATAGTAG CCTAACAGTT GGTGCTGATG TTGTAGCCCA CTTGCTATAC GAACCACAAC GTTTAAGTGT ATCGTACCGT TTACAAGAAT ATGGTTTTGA TAAGAACCAT GCGAATCATC CACAATACTA CAACTATTTT TGGACGCCTA AATCATATAC AACACATACG TTTGGGCTTG AATGGCAACA TTACCTTAAT AGAGAGCGTT TCCATGGTTC TAATAACACA TACTATGATA TTGCTTTCCG TGTTGGACTT GAACAAGAGG GTGATATTTC ACGCCAAATC CATGCAAGCA TTAACCACGA CTGGAATAGC CGTCTTGCCA CATCGTTAGA GGGGCAATAT ACATGGGGAA CAAGTGCTGA AATTTATCAG GACAGCATGG TCAAAGCTGA GTTCCGCTGG TTTTTATAA
|
Protein sequence | MKVRNLVLFT LMAVSAEGYA AGTKAKSIVK TPSAATQAAE TMQALPLDRQ TALNLAQSYL ANGSSRQAEL ILSKLVLLYP DDEEILRETI SLYEKSNRAE QTLPLYQHLL QLRPNDLELT LASARAYSWT GRKAESIALY EKVLKAGNAS EKVVTEYADF LYADKQYQKA IDLYKSVGQK GKLSKQHMLN TVNGFIALKK FDEAAKICNA NVPLYPQDTD FLRLAADINF NAKQFEEAAG HYRQLLLKNP DDPGAYSKLA DIAMAKNDFT EVARLSHKIL ALIPDHKTAM LSLARVSSWQ GDFTTSLTYY DKLIASPNPE PFYYREKARV LGWMGDFKQA LSVYKSAVQK WPDDKAISAE AEAKKNYYNH TYRPAVKAYN AWLLAEPQQP EALFDLAQLY AQYGKWNNGL NTYNSLLSQI PAHRQAALAK QKIDFAASRM FVRSGVEYFS AKTKFIDATN HRQADTKSTS IYSSLTYPIN ERVSAFVNLD SKSYDFRIAK PNTPKNPVTY GLMAGAEYRN MPNIALSAGL GMRMNPGDVD NGLTGFINAN SQPVDNLHVG VTLRNDDIVT NTSSFNNQLE ATRLQGRVAY NGYRRWQAGM DIAFDSYADN QSYDNSSLTV GADVVAHLLY EPQRLSVSYR LQEYGFDKNH ANHPQYYNYF WTPKSYTTHT FGLEWQHYLN RERFHGSNNT YYDIAFRVGL EQEGDISRQI HASINHDWNS RLATSLEGQY TWGTSAEIYQ DSMVKAEFRW FL
|
| |