Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4970 |
Symbol | |
ID | 5902432 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 5374010 |
End bp | 5375611 |
Gene Length | 1602 bp |
Protein Length | 533 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 641565491 |
Product | GSCFA domain-containing protein |
Protein accession | YP_001686588 |
Protein GI | 167648925 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5010] Flp pilus assembly protein TadD, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.432309 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTTTGG TGCACGTTCC GGCTGAACAA GCGTTGGGTT CAAAGAACCG CTTTGACCGA TGGGGAAAGG CTGCCGAACG TCTGAAGCCC GAGTGTTGGC CGCACGTCAG CACGCCCTTC GCGCTTCATC GCGGCGCGAA GGTGTTCACC ATAGGCTCGT GCTTTGCCCG AAACATCGAG GAACGCCTAG CTCGAGTGGG GTTCGACATC CCCATGCTCG CGTTCAGCGC GCCTCAGTCT GAACATGCAG GCGCGCGGGC GGCCGGCATT CTGAACAAGT ACACGCCGGC CAGCATTTAC CAGGAGATCG CGTGGGCCGC CGATATCTAT GAGCGAGATA GCGTTCCCAC CCGAGCCGAC TCCGAGAAGT TCCTTTACCT GCTCGATGAT GGTTCAGCGA TCGACAACAA TCTGGCGGAC CACGTCTCGG TTGGCCTCGA CCGATTTTTT GAGCGTCGCG TCGATATTTA CAGCGTCTTC AAACATGCCT TTGATGCCGA ATGTGTTGTT ATCACGCCTG GCCTAGTGGA AGCGTGGTGG GACACCGAGC GTGGGATATA TATCCAGGAT CCGCCATGGC CGCGCGAGCT GAAGAAGTTC CGAGGGCAAT GCGAGTTCGT GCAACTCGAC TATACGACGG CGTTTGACTA CCTGCAGCGA ACAATCGATC GTATCCGATC GATCAATCCT GACGCGAAGT TCCTGATCAC CACGTCGCCG GTTCCGCTTG GCAAGACGTT CACCGACGAC GACATCATCG TCGCGAATAG CTACGCCAAG TCCACGCTGC GCGCGGCGTG CGGTGATCTG GTCAAGCGCA ATGACAACAT CGGTTATTTT CCGAGCTTCG AGAGCGTCAT GCTCTCCAAG GGCGATGGGG TTTGGGAAGA TGACGGTATC CACGTCACTC AGGCCTTCGT CAGCAGCATC GTGGCCCACC TGACGCAAGC CTATTGCCCG GACGTCAGCG AAGGTGACCG GCTTTTCCTG TCGAGCCTGT CGAGCACTGA CACGAATGAA CGTCTGGCGC TGGCCAAGAG GGCCGTTGAG CTGGAACCCG AGCGCCCCGA ATTGCTGGAC CATCTGGGCA CTCTCTATTG CAACGCTCAG GACTTCGAGG CGGCTGTTGG TGTGCTTCAA CGGGCGGTGG ACCTGCGCCC CGACTGGGAA CATCGCTATC ATCTGGCCAT GGCTCTGCAG GGTGTTCGGC GGTTCCGCGA GGCCGGAGAA CTGCTTGAAG TTCTAGTCGT CGAAAACCCG GACTCAACGG ACGCGGCAAC CCGGCTCAGC CACTCCCTCA TCATCCTAGG ACAAGCGCAG CGCGCCAGGG CGTTCTTGGA GCAGCGGATC GCCTCAGCGC CATCATCGGC GCTTTACTAC TGGCTCAGCA CGGCCATGGG CCATGCCGGC GACAACGCCG ACGCCGCGTT GATGGCGGAG AAATCTATCG AACTGGACCC GGGCAACCCT CATAACTGGT ACTTGGCTGG CACGTATCAT GCCAAGGCAA ACAGAAAAGC GCCTCGACCC TTCTTCGAGA AGGCGTTGGA GATCGCGCCC GACGTCAAAG CCTTCCAAGA CGCCCTGAAG CCCCAAATAT AG
|
Protein sequence | MALVHVPAEQ ALGSKNRFDR WGKAAERLKP ECWPHVSTPF ALHRGAKVFT IGSCFARNIE ERLARVGFDI PMLAFSAPQS EHAGARAAGI LNKYTPASIY QEIAWAADIY ERDSVPTRAD SEKFLYLLDD GSAIDNNLAD HVSVGLDRFF ERRVDIYSVF KHAFDAECVV ITPGLVEAWW DTERGIYIQD PPWPRELKKF RGQCEFVQLD YTTAFDYLQR TIDRIRSINP DAKFLITTSP VPLGKTFTDD DIIVANSYAK STLRAACGDL VKRNDNIGYF PSFESVMLSK GDGVWEDDGI HVTQAFVSSI VAHLTQAYCP DVSEGDRLFL SSLSSTDTNE RLALAKRAVE LEPERPELLD HLGTLYCNAQ DFEAAVGVLQ RAVDLRPDWE HRYHLAMALQ GVRRFREAGE LLEVLVVENP DSTDAATRLS HSLIILGQAQ RARAFLEQRI ASAPSSALYY WLSTAMGHAG DNADAALMAE KSIELDPGNP HNWYLAGTYH AKANRKAPRP FFEKALEIAP DVKAFQDALK PQI
|
| |