Gene Caul_5214 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5214 
Symbol 
ID5897412 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010335 
Strand
Start bp137820 
End bp139046 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content73% 
IMG OID641555317 
Productconjugation TrbI family protein 
Protein accessionYP_001676648 
Protein GI167621863 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2948] Type IV secretory pathway, VirB10 components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGCCG CGCGCGACGG CGCGCCAGCC TCTGATCCGC AAGATCTCAG CGCCGCCGCC 
AAGGAGCCGA CGGCCACGGT GCTGCGCGCG CCCCGGCCGC CGATCACGCG TTACCGGCCC
GCCGTGATCG GCGCGGCGCT GCTGGCGGTG TTGTTGCTGG TGGGTCTCGG CTTCCTGATC
GCTTTTGGCG GCGGCCACAA GCGCCCGTCG ACCGCCGCCG CAAGCCCGGC GCCCGCCGAG
CCTGCCACCA CGGCCACGCC GATCGACGAG CGCTTGCCTG CGACCTACGG GGAGCTGGGG
CCCGCGCCGG CCCCCGCAAG CCTCGCCGCC GCTCCGGGGG CGGCCAGTCC CAGCCAGGAG
GCCGGCGTCA CGACCAGCAG CGGCGCTTCG ACCGGCGGTG ACGGCGGCGC GCGACAGCGA
CAGCTTGAAG ACCAGCGCGC CGCGCAAGGC TCGGCGCCGT TCTTCGGCGG CGCGGCCGGA
TCGACCGCCC AGGCCGCGGC CGCGCCGAGC CTGCCCCCGC TGGCCTTCGC CGGACCGGAG
GCCGCCCCCA CCCCGGCCGC CGGCCTCAGC GCCAAGGAGG GGTTCATCGC CCGGGCCTCC
GCGCCCCAGG CCAACTACGC GCCGGGTTTG CCCCAACCAC CCCTATCGCC TTACGAGGTC
AAGGCCGGCT CCGTGATCGC CGCGGCCCTG GTCACCGGAC TCAACTCCGA TCTGCCCGGC
ATGGTCGTGG CCCAGGTCAC CCAGCCGGTG TTCGACCACG CCACGGGCCG TGTCATGCTC
ATTCCCCAGG GCGCGCGCCT GATCGGCAAG TACGACAGCC AGGTCGGTTA TGGGCAGGAC
CGGGTGCTGC TGGTCTGGAC TCGGCTGATC TATCCCAGCG GCCGGTCGGT GGACCTTGGC
GCGATGACCG GGGCCGATGT CACCGGGGCC GGCGGACTAT CGGACCGCAC CGACACCCAC
CTTCCGGTGC TGGCGCGGGC CATTGGTCTT TCGACCCTGA TCTCGATCGG CGGCGCGGCC
GCTCAGAACA GCGTCGCGCG CGGGAGCGAC AACCTGGTCC TGCAAGACGG GGCCGGCGGG
ATCGCCTCGC AGGCCAGCCA GACGGGCCAG AGGCTCGTCG AGCGCGATCT GCAACGCAAT
CCGACCTTGC GCATCCGGCC GGGTTTCCCG GTTCGAGTGA TGGTCGACAA GGATCTCATC
CTGCCACCCG AAGGAGCGCT TCAATGA
 
Protein sequence
MSAARDGAPA SDPQDLSAAA KEPTATVLRA PRPPITRYRP AVIGAALLAV LLLVGLGFLI 
AFGGGHKRPS TAAASPAPAE PATTATPIDE RLPATYGELG PAPAPASLAA APGAASPSQE
AGVTTSSGAS TGGDGGARQR QLEDQRAAQG SAPFFGGAAG STAQAAAAPS LPPLAFAGPE
AAPTPAAGLS AKEGFIARAS APQANYAPGL PQPPLSPYEV KAGSVIAAAL VTGLNSDLPG
MVVAQVTQPV FDHATGRVML IPQGARLIGK YDSQVGYGQD RVLLVWTRLI YPSGRSVDLG
AMTGADVTGA GGLSDRTDTH LPVLARAIGL STLISIGGAA AQNSVARGSD NLVLQDGAGG
IASQASQTGQ RLVERDLQRN PTLRIRPGFP VRVMVDKDLI LPPEGALQ