Gene Caul_2025 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2025 
Symbol 
ID5899480 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2167970 
End bp2170006 
Gene Length2037 bp 
Protein Length678 aa 
Translation table11 
GC content68% 
IMG OID641562514 
Productconjugal transfer coupling protein TraG 
Protein accessionYP_001683651 
Protein GI167645988 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3505] Type IV secretory pathway, VirD4 components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCCCA CCAAGCTCCT CATCGGCCAG ATCCTCGCGG TCTTCGCGAT CATGATTCTG 
GGCGTCTGGG CCGCCACCCA GTGGGCCGCC GCCATGCTGG GCTACCAAGA CCGCTTGGGC
GCGCCCTGGC TCGAGGTGCT GGGTCAGCCG CTCTACCGGC CCTGGCAGCT TTTCGACTGG
TGGTACCACT ACGAGGCCTA CGCGCCGCTG GTCTTCGACA AGGCCGGGAT GTTGGCGGGC
GCCAGCGGCT TTCTCGGTTG CGCCTGCGCC ATCGTCGGAT CGCTGTGGCG CGCCCGCCAG
ACCGGGCAGG TCACGACCTA TGGATCCTCG CGCTGGGCGA CCCGGTCCGA GGCGGCCGCC
GCTGGCCTGT TCAAGGATGA CGGGGTCTTT CTCGGGCGAC TGGGCGCCGA CTATCTCCGC
CACGCCGGCC CCGAGCATGT CATGGCCTTC GCGCCGACAC GGTCGGGCAA GGGCGTGGGT
CTGGTGGCGC CCAGCCTGCT GTCTTGGACG GGCAGCGCCG TCGTCCACGA CATCAAGGGC
GAGAACTGGA CCCTGACCGC CGGCTGGCGG GCGCGGTTCT CCCATTGCCT GCTGTTCAAT
CCGACCGACG CCCGGTCGGC CCGATACAAT CCACTGCTGG AAGTGCGCAA AGGCCCTGAC
GAGGTCCGCG ACGTTCAGAA TATCGCCGAC GTGCTCGTAG ACCCGGAAGG GGCTTTGGAG
CGCCGATCGC ACTGGGAAAA AACCAGCCAC AGTCTGCTGG TCGGCGCGAT CCTCCACGTC
CTCTACGCCG AGGAGGAAAA GACCCTCGCA CGGGTCGCCA CCTTCCTCTC CGACCCTCAG
CGCTCCTTCG CCCACACGCT GCGCCGGATG ATGGCGACCA ACCATCTGGG CACGCCGCAG
GCGCCCAAGG TTCACCCCGT GGTCGCCAGC GCGGCCCGAG AGGTCCTCAA CAAGGCGGAG
AACGAACGCA GCGGCGTGCT CTCCACCGCC ATGTCGTTCC TCGGGCTCTA TCGGGACCCG
ACCGTCGCCC TGACGACTTC GGCCTGTGAC TGGCGGATCG CCGATCTCGT CGACGCCCCG
TCGCCGGTCA CCCTCTATCT GGTCATCCCG CCGTCCGACA TCTCGCGGAC CAAGCCCCTG
GTGCGTCTGG TGCTCAACCA GATTGGCCGC CGTCTGACCG AGCGTCTGGA GGGCGATCCG
AAGAAGAGCC GCAAACACCA GCTGCTGATG ATGCTCGACG AGTTCCCGGC CCTGGGACGG
TTGGACTTCT TCGAGACGGC CCTGGCCTTC ATGGCCGGCT ACGGCATCCG CGCTTACCTG
ATCGCCCAGT CCCTCAACCA GATTTCCAAG GCCTACGGCG AGAACAACGC GATCCTGGAC
AATTGCCATG TCCGGATCGC GTTCTCCTCC AACGACGAGC GCACGGCCAA GCGAATCTCC
GACGCCCTGG GCACGGCGAC CGAGCAAAGA GCGCAGCGCA ACTACGCTGG CCACCGGCTC
TCACCCTGGC TCAGCCACGT CATGGTCAGC CGCCAGGAAA CGGCCCGGCC GCTGCTGACC
CCAGGCGAGG TCATGCAGTT GCCGCCCGAC GACGCCCTGG TGCTGGTCTC GGGCCTGGCG
CCGATCCGCG CCAAGAAGCT GCGCTACTAC GAGGACCGCA ATTTCACCGG TCGCGTGGCG
CCGCCGCCGG CGCTTGTGGA CGGCCCCTAT CCCGACTGCC CGCCCTCGCG CGGTGACGAC
TGGGGCGGCC AGGTGCGCGG GCCCCATGCC GGCCTGGCCG CCGACGACGA TGCGCTCGAC
GCCGAAACCG ATGGCGGCCT GCAGCAGGAG CGTCATCCCG GGTTGTTCGA CGAGGTCGTC
GCCCAAGCGC CCGCCGACGA GACCGAACTC GGGCTGGGTC TGGCTGATGA CGACGGCGAC
TTGGTCGCCG ACCGCCAGGC CGTTGATCGC GCCCGCATGG GCCAGGCCGT CACCCGCGCC
CACGCCATGA ACGAGGGGAG CGATCGCGGC GACAACGACC TCCTCCCGGC GTTCTGA
 
Protein sequence
MTPTKLLIGQ ILAVFAIMIL GVWAATQWAA AMLGYQDRLG APWLEVLGQP LYRPWQLFDW 
WYHYEAYAPL VFDKAGMLAG ASGFLGCACA IVGSLWRARQ TGQVTTYGSS RWATRSEAAA
AGLFKDDGVF LGRLGADYLR HAGPEHVMAF APTRSGKGVG LVAPSLLSWT GSAVVHDIKG
ENWTLTAGWR ARFSHCLLFN PTDARSARYN PLLEVRKGPD EVRDVQNIAD VLVDPEGALE
RRSHWEKTSH SLLVGAILHV LYAEEEKTLA RVATFLSDPQ RSFAHTLRRM MATNHLGTPQ
APKVHPVVAS AAREVLNKAE NERSGVLSTA MSFLGLYRDP TVALTTSACD WRIADLVDAP
SPVTLYLVIP PSDISRTKPL VRLVLNQIGR RLTERLEGDP KKSRKHQLLM MLDEFPALGR
LDFFETALAF MAGYGIRAYL IAQSLNQISK AYGENNAILD NCHVRIAFSS NDERTAKRIS
DALGTATEQR AQRNYAGHRL SPWLSHVMVS RQETARPLLT PGEVMQLPPD DALVLVSGLA
PIRAKKLRYY EDRNFTGRVA PPPALVDGPY PDCPPSRGDD WGGQVRGPHA GLAADDDALD
AETDGGLQQE RHPGLFDEVV AQAPADETEL GLGLADDDGD LVADRQAVDR ARMGQAVTRA
HAMNEGSDRG DNDLLPAF