Gene Caul_2168 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2168 
Symbol 
ID5899623 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2352137 
End bp2353858 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content73% 
IMG OID641562659 
ProductTPR repeat-containing protein 
Protein accessionYP_001683794 
Protein GI167646131 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.28547 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.066147 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCGTA TCAAGCTTTT AGCCGCCCTC TTGTCGGCCT CGGCCCTGAC CGCCTGCGCC 
ACCGCGACGC CGGGTCCGGC CCTGCGCGGC CACGTTTCCA CCGACTCCGG CTGGGGCGAC
ACCCGCTCGG CCTACGGCCA GTTCCTCGCC GGGCAATCGG CCCTGCAGAA CGGCTCCAAC
GCCCAGGCCG CCGCCTATTT CGACGAGGCG CGCGAGCTGG GCGAGGATCC CGGCGTGCTG
ATCGACCGCG CCTTCACCGC GGCCTTGCTG GCCGGCGACG TCCAGCGGGC CGCCGCCATC
GCCCCGCGCG GCGGCGAGAC CAACGAGGCC GTCCGCCGCC TGGGCGTCCT GACCCGCGTG
GTCGACCTGC TGGCCAGCGG CAAGGGCCAC GACGCCCAGA CCCTGCTCAA GAGCGAGACG
ATCGGCTTCC CGCATCGCCC GGCCGCGGTG CTGCTGGCCC CCTGGCTGGC CGCCGCCGCG
GGCGACAACA ACGGCGCCGT GGTTCGTCCC GAGCTGCAGG GCGACCGCTT CGTGCAGTAT
TTCGGTCAAC TGGGCCAGGC GCGGCTGTAT GAGCGGGCCA GGCGCTATGA CGAGGCCGAG
ACCGACTACA AGGCGCTGAT GGCGGCCGAG AACGCCCGCG CCCTGTTCGT CGAGGACTAC
GGCGCGTACC TGGAGCGCCG CAAGCGCCAC GACGACGCCG TGGCGCTGTA CGACTCGGCC
CTGACCCGCG ATCCGGACGA CCAGGGGCTG CTCAAGGCCC GCGCCCGCGC CGCCGCCCAC
GGTCCGGCCC CGGCCATGCC GACCGAGAAG CAGGGCGCCG CCCACGCCCT GGTCGCCTGC
GCCGCCACCT TCGCCCAGGA GCGCCAGAGC CAGTTCAGTC TGGCCTATCT GCGCCTGGCG
CTTCGGCTGG ACCCCAAGCG CGACGACGCC TGGCTGCTGG TCGGCGACCT GCTGGCCCAG
GACGACGACG ACGCCGGCGC GCGCGAGGCC TACGCCAAGG TGCTTCCCGG CTCGCCCCGC
TATGTGGCCG CCCAGTCCAA GCTGGCCTGG AGCTATCAGA GCGGCGGCGA CAAGCCCAAG
GCCATCGCCC TGGCCCAGCA GGTCGCCGCC GCCGCGCCCA AGGACCGCGA CGCCCAGATC
ACCTATGCCG ACCTGCTGCG CGCCAACGAT CGCTGGGCCG AGTCGGCCGC GGCGCTGGAC
CCCCTGATCG CGTCGGAGGC CGCCAAGCCC GACTGGCGGC TGCTCTACAT GCGCGGCATC
GCCCTGGAGC GCGCCGGCCG CTGGAGCGAG TCCGAGCGCG ACCTGCTGGC CGCCCTCAAG
CTCAGCCCGG ACGATCCCGA CCTGCTGAAC TATCTGGGCT ATTCCTGGAT CGATCGCGGC
GAGCACCTGC CCGAGGCGAT CGCCATGGTC CAGAAGGCCG TCGAGGCCCG CCCGCAGTCG
GGCGCCATGC TCGACTCCCT GGGCTGGGGC TACTACCGCC AGGGCGACTA CAAGACCGCC
GTGGCCAAGC TGGAGCAGGC CGTCGAGCTC GAGCCCGGCG ATCCGGACGT CAACGGCCAC
CTGGGCGACG CCTATTGGCG CATCGGGCGC AAGGTCGAGG CGCGCTACCA GTGGCAGCGC
GTGCTCAGCC TGGAGCCGGA CGAAAAGCAG AAGGCCGAGG CGATGGCCAA GCTGAAGGAC
GGGCTGGACG GTCCGCCGGC CAAGGTCGCC TCGGCGGGGT AG
 
Protein sequence
MTRIKLLAAL LSASALTACA TATPGPALRG HVSTDSGWGD TRSAYGQFLA GQSALQNGSN 
AQAAAYFDEA RELGEDPGVL IDRAFTAALL AGDVQRAAAI APRGGETNEA VRRLGVLTRV
VDLLASGKGH DAQTLLKSET IGFPHRPAAV LLAPWLAAAA GDNNGAVVRP ELQGDRFVQY
FGQLGQARLY ERARRYDEAE TDYKALMAAE NARALFVEDY GAYLERRKRH DDAVALYDSA
LTRDPDDQGL LKARARAAAH GPAPAMPTEK QGAAHALVAC AATFAQERQS QFSLAYLRLA
LRLDPKRDDA WLLVGDLLAQ DDDDAGAREA YAKVLPGSPR YVAAQSKLAW SYQSGGDKPK
AIALAQQVAA AAPKDRDAQI TYADLLRAND RWAESAAALD PLIASEAAKP DWRLLYMRGI
ALERAGRWSE SERDLLAALK LSPDDPDLLN YLGYSWIDRG EHLPEAIAMV QKAVEARPQS
GAMLDSLGWG YYRQGDYKTA VAKLEQAVEL EPGDPDVNGH LGDAYWRIGR KVEARYQWQR
VLSLEPDEKQ KAEAMAKLKD GLDGPPAKVA SAG