Gene Caul_5221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5221 
Symbol 
ID5897391 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010335 
Strand
Start bp143299 
End bp145758 
Gene Length2460 bp 
Protein Length819 aa 
Translation table11 
GC content69% 
IMG OID641555324 
Productconjugal transfer ATPase TrbE 
Protein accessionYP_001676655 
Protein GI167621870 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3451] Type IV secretory pathway, VirB4 components 
TIGRFAM ID[TIGR00929] type IV secretion/conjugal transfer ATPase, VirB4 family 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGTTTC TCAACGAGTA TCGCGCGCGC TCGGGCCGGC TTTGCGATCA CCTCCCCTGG 
GCGCTGCTGA TCGCGCCGGG CGTGGTGCTC AACAAGGACG GCGCCTTTCA GCAGACCCTG
GAATATCGCG GGCCGGACCT GGCCAGCGCC ACTCCAGCCG GGCTGATGGC GGTGCGGGCC
CAGCTCAACA ACGCCCTGCG CCGCCTGGGC TCGCGCTGGT GCTTGCATGT CGAAGCCCTG
CGCGCGCCGT CACAGTCCTA TCCGGCGGCG GCCTTCCCCG ACCCGGTCAG CCACATCATC
GACGAGGAGC GTCGCGCGGC CTTCGAGGCG GAGGCCGCTC ACTTCGAGAC CCGCTACTTC
CTGACCTTCA CCTTTCTGCC GCCCGACGAC GCCATCGGCC GGGCCGAAAG CCTGCTGGTC
GAGAACCTTC CTCAAGGCCG CGGGGCTCAG GCGCTCTACC GCCTGGCGCT GGACGAGTTC
GTCTCGACCG TCGGCGGCGT GCGCGAGATC CTGGCGGGCG TCTTCCCGCT GGTGCGCCGC
TTGGATGACG CCGAGACCCT GGCCTATTTG CACGCCTGCG TCTCGACCAA GGCCCAACCG
GTGTCGCCGC CCGACCCGCC GGCCTATCTC GACGCCGTCC TGACCGACGA TGATTTTCAA
GGCGGCCTCT TTCCGAGGCT GGGCGGGGCC TATCTGCGCA CCATCTCGAT CCGCGCCTAT
CCGGCCGCCT CCTGGCCCGG CATGCTCGAC CAACTCAACA GCCTGGGTGT GCCCTATCGC
TGGGTCTGCC GCTTTCTGCC TCTGGACAAG GAGGACGCCC GGCGCTCGAT CACCGCCTTG
CGCAAGCGCT GGTTCGCCAA GCGCAAAGGC ATGCTGGCCT TGCTTAAGGA GGCCATCACC
CACGAGCCGT CCCTGCTGGA GGACCCCGAC GCCCTGCAGA AGACCCAGGA CGCCGACGCG
GCCCTGATGA TCCTGGGCGG CGACGCGGCC AGCATGGGCT ACCTGACCCC GACCATCACC
CTGATCGACC GCGATCCCGA TCGCTTGGCG CTGAAGGCCA GGCTGGTCGA AGGCGTGATC
AACCGGGCCG GCTTCGTCAG CAAGCTGGAG GACCTCAACG CCGTTGAAGC CTGGTTGGGC
AGCCTGCCGG GGCAGGCCTA TGCGGATCTG CGCCGACCGC TGGTGTCCAC GCTCAATCTT
TGCGACCTGC TGCCGGTCTC GGCGATCTGG CCGGGGCCGA AGATCAACGC CCACCTCACC
GAGGAGGCGC GCAAGCAGGG CGAGCTTGGC GACCAGCCGC CGCTGATGCA TGCGCGCACG
GCGGCGACCA CGCCGTTCCG CCTGGACCTG CACCAGGGCG ATGTGGGTCA CACCTTCGTG
GTCGGGCCGA CGGGCGCGGG CAAGTCGGTG CTGCTCAACA CGCTGGCCCT GCAATGGCGG
CGCTATCCGA AGGCGCGGGT CATCATCTTC GATAAGGGCC GCAGCAGTCG CGCCGCGACC
CTCCTGGTGG GGGGCGGGTT CTACGACCTG GGTCCCGCGG GCGAAGATCT GGCCTTTCAG
CCTCTGGCCG AGATCGACCG GCCCGAGGAG CGGATCTGGG CTCAGGACTG GATCCTGGAT
CTGCTCGGCG CCGAAGGCGC GATCGTCAAC CCGGCGGTCA AGGACGAGCT CTGGGCGGCC
CTGGCCAATC TCGCCGCCGG GCCCCGCGAG CAGCGCACCC TGACGGTTTT GGCCGCCACC
ATCCAGGATC ACGCCGTCAA GGCCGCGCTC AAACCGTTCA CCCTGGCCGG CGCGCACGGC
CGTCTGCTGG ACGCGGGAAC CGAGACCCTG ACGACCCGCG ACTGGCAGGC GTTCGAGCTG
GGCGCCCTGA TGGAAAGTCC CGCCGCCCTG GCGCCGGTGC TGACCTATCT CTTCCATGTG
CTCGAGCGCG GGTTCGATGG CCGCCCGACC CTGATCGTCC TGGACGAGGC TTGGCGCTTC
TTGGAGACGA CCGCCTTCGC CAAGCGCATC CGCGAGTGGC TGTGGACGGT GCGCAAGCTC
AACGTCTCGG TGATGTTCAG CACGCTTAGT TTGTCGAGCG TCACTGACAG CCCGATCGCG
CCGGCCCTGC TCGAAGGCTG CCCCACCCGG ATTTTCCTAC CCAATCCCGA GGCCCGCACG
CCGCTCGTCG CCAAGGGCTA TGCCGGGTTT GGCCTCAACG ACCAACAGAT CGAGATCATC
GCCGCCGCCG CGCCCAAGCG CGAATATTAC TATCAGAGCG CGGCGGGCAA TCGCCTGTTC
GAGCTGGGCC TGGGCCCGGT GGCGCTCGCC GCCGTCGGCT CGGCCAGCGC CGCAGACCAG
GTGCTGATCT CCCGGCTGTT GGACACCGCG GGCCCGGGCG GGTTCGCCGC GGCCTTCTAT
CATCACAAGG GACTGGCCGA GGTCGGGGCG TTCCTGGAGG ACGCGCGCCG TGCGGCCTGA
 
Protein sequence
MLFLNEYRAR SGRLCDHLPW ALLIAPGVVL NKDGAFQQTL EYRGPDLASA TPAGLMAVRA 
QLNNALRRLG SRWCLHVEAL RAPSQSYPAA AFPDPVSHII DEERRAAFEA EAAHFETRYF
LTFTFLPPDD AIGRAESLLV ENLPQGRGAQ ALYRLALDEF VSTVGGVREI LAGVFPLVRR
LDDAETLAYL HACVSTKAQP VSPPDPPAYL DAVLTDDDFQ GGLFPRLGGA YLRTISIRAY
PAASWPGMLD QLNSLGVPYR WVCRFLPLDK EDARRSITAL RKRWFAKRKG MLALLKEAIT
HEPSLLEDPD ALQKTQDADA ALMILGGDAA SMGYLTPTIT LIDRDPDRLA LKARLVEGVI
NRAGFVSKLE DLNAVEAWLG SLPGQAYADL RRPLVSTLNL CDLLPVSAIW PGPKINAHLT
EEARKQGELG DQPPLMHART AATTPFRLDL HQGDVGHTFV VGPTGAGKSV LLNTLALQWR
RYPKARVIIF DKGRSSRAAT LLVGGGFYDL GPAGEDLAFQ PLAEIDRPEE RIWAQDWILD
LLGAEGAIVN PAVKDELWAA LANLAAGPRE QRTLTVLAAT IQDHAVKAAL KPFTLAGAHG
RLLDAGTETL TTRDWQAFEL GALMESPAAL APVLTYLFHV LERGFDGRPT LIVLDEAWRF
LETTAFAKRI REWLWTVRKL NVSVMFSTLS LSSVTDSPIA PALLEGCPTR IFLPNPEART
PLVAKGYAGF GLNDQQIEII AAAAPKREYY YQSAAGNRLF ELGLGPVALA AVGSASAADQ
VLISRLLDTA GPGGFAAAFY HHKGLAEVGA FLEDARRAA