Gene Caul_1152 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1152 
Symbol 
ID5898607 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1217970 
End bp1220828 
Gene Length2859 bp 
Protein Length952 aa 
Translation table11 
GC content69% 
IMG OID641561634 
Productconjugal transfer relaxase TraA 
Protein accessionYP_001682780 
Protein GI167645117 
COG category[L] Replication, recombination and repair 
COG ID[COG0507] ATP-dependent exoDNAse (exonuclease V), alpha subunit - helicase superfamily I member 
TIGRFAM ID[TIGR02768] Ti-type conjugative transfer relaxase TraA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00271398 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000434964 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGGCGATCT ACCACTTCTC CGTCAAAGTC ATCAGCCGCG CTACAGGGGC CAGCGCCGTG 
GCCTCGGCCG CCTATCGCTC CGCCTCGCGG CTGCACGACG AGCGGCTGGA TCGCGATCAT
GACTTCACCA ACAAGAGCGG CGTCGTCCAT TCCGAGGTCA TGCTGCCGGA TGGCGCGCCC
GAGCATCTGT CGGATCGCGC AACGCTGTGG AACACCGTAG AGGCCGGCGA AAAGCGCAAG
GACGCCCAGC TCTCGCGCGA GGTCGAGTTC GCCATCCCCC GCGAGATGGA CCAGGCCCAG
GGCATCGATC TCGCCCGCGA CTTCGTCCAG CGCGAGATGG TCGATCGCGG CATGGTCGCC
GATCTCAATG TGCATTGGGA CATCGGCGCG GACGGCCTGG CCAAACCCCA CGCCCACGTC
ATGTTGTCGA TGCGGGAAGT GGGCGAGGAC GGCTTCGGCG CCAAGGTTCG CGACTGGAAC
CGCACCGAGT TGGTGGAGCA CTGGCGGGAG GCTTGGGCCG ACCACGTCAA CGAGCGGCTG
GCGCAACTCG ACATCGACGC GCGGATCGAT CACCGCAGCC TGCAAGACCA GGGCATCGAC
CTTGAGCCGC AGCACAAGAT CGGCCCGGCC GCTGGGCGCA GGGCCGACGA AGGCCTGGAG
GCTGAGCGGC TCGACGAGCA CCACGAGATC GCGCGGGCCA ATGGCGAGCG GATCATCGCC
GATCCGCGCA TTGCTCTGGA CGCCATCACC AAGCAGCAGG CGACCTTCAC CCGCCGCGAC
CTGGCGATGT TCGTTCACCG CCATTCGGAT GGGAAGGAAC AGTTCGACCG CGCCATGGGC
GCGGTGCAGG CCTCGCCCGA ACTTGTCGCT CTGGGCCAGG ACGGGCGCGG CCAGGACCGC
TTTACCAGCC GCGAGATGAT CGCGGTCGAA GATCGCCTGC ATCGGGCTTC GGCTTTGATG
GCCGAGCGCC GGGCGCATCG GGTCAGCGAA CTCGATCAGC GCCGCGCCCT GACACGCGCG
GAGCAGCGCG GTCTGGTCCT GTCCGGTGAG CAGAAGGCGG CGTTCGAGCA CGTCACAGCG
ACCCGCGATC TGGGCGTTGT GGTCGGCTAT GCCGGCACGG GCAAGTCGGC GCTGCTAGGC
GTGGCGCGCG AGGCCTGGGA AGACGCCGGC TATCGCGTTC AGGGCCTGGC CTTGTCGGGC
ATAGCCGCCG AGAACCTGGA GGGCGGATCG GGCATCGCCT CGCGCACGAT CGCCAGCCTG
GAGCATCAGT GGAGCCAGGG CCGCGAACTA CTGGATACGC GCGACGTGCT GGTGATCGAC
GAGGCCGGCA TGATCGGCTC TCGGCAGATG GAGCGGTTGC TGTCGGCGGC CGAGAAGGCC
GGCGCCAAAG TGGTGCTGGT CGGCGATCCC GAGCAGCTTC AGGCCATCGA GGCCGGCGCG
GCGTTCCGGT CGATCGCGGA GCGCCATAAC CACGTCGAGA TCACCCAGGT TCGCCGTCAG
CATGAGGACT GGCAGCGCGA CGCCACGCGG CACCTGGCCA CCGGCCGGAC CGGCGAGGCG
CTGGTGGCTT ACGAGGCGCG CGGCATGGTC CACGCCGCCG AGACGCGCGA CCAGGCACGC
GAGGAGTTGG TCGAGCGGTG GGATCGCGAC CGCGTGGCCG AGCCGGGCAA GAGCCGGATC
ATCCTCACCC ACACCAATGA CGAGGTGCGC GACCTCAATC TGACCGCGCG CGAGCGGCTG
CGCCAGGCTG GCGCCTTGGG GCAGGACGTG ACGGTCAAGG CCGAGCGCGG CGAGCGCGCC
TTGGCGGCCG GCGATCGGCT GATGTTCTTG AAGAACGATC GTGGGCTCGG CGTGAAGAAC
GGCATGCTCG GCGAGATCGA GCAGGTCTCG CCCACCCAGA TGACCGTACG ACTCGACGCG
GGGCGCTCGG TCGCCTTCGA CCTGAAGGAC TACGCCCAGG TCGATCACGG CTATGCGGCC
ACCATCCACA AGAGCCAGGG TGTCACTGTC GATCGGACCC ACGTGCTGGC GACGCCGGGC
ATGGATCGTC ACGGCGCCTA TGTCGCCCTG TCACGCCACC GCGACAGCGT GCAGCTCCAT
TACGGCCGCG ACGAGTTCGA AGACTTGGGC AAGTTGACCC GTGTCCTGTC GCGCGAGCGG
GCCAAGGACA TGGCCAGCGA CTTTGCCGAA CGGCGCGACA TTCATGTGCC GCCGTCGATG
CGGCCCAAGG AGCCGCAGCA GCGCGAGCGC GGGATGTTCG CCGGGTTCAG GCCGGTCAGT
CCCGCGCATC AGCCGGCGGC GCCGGCGCGC GATGACGATC AGCTTGCTCG ACGCCGGTCG
ATCGAGCGCC ACGCCCGCGC CGCGGCCGAC GTCATGCGCA TGAACGACCG CGGCTTGCCG
GTGCTCGCCC ATCAGCGGCG GACGCTGGAG CAGGCGCGCG AGGCCATGGG GAAGATCGGT
CCGCACGCAG CGAAGGATCT GGAGCGGGCC TATGTCCGTG ATCCGGCCTT GGCCGGCGAG
ACCGCCTCAG GGCGCACTCA GCGGGCTATC CGCGTCCTTC AGCTTGAGGC CGAGGTGAGG
GCCGATCCGC GTTTACGCGC CGACCGCTTC GTCGAGCGTT GGCAGGGTTT GGAGCGCCAG
CGCACGGCCT TGCACCGCGC GGGCGACATG ACCGGCGCCG GCCAGGTCAA GGATCGTATG
GGCGCGATGG CCAAGAGCTT GGAGCGCGAT CCGCAGGTGG ACTCGCTGCT GCGCGCTCGG
CGGCCCGAGC TTGGGCTTCC CGCCGAGATG GGGCGCAGCG TTGGGCAGGG CCTTTCTGAC
TACCTCGGCA TCGGGCGCGG CCGAGGTCTT GGCTTGTAG
 
Protein sequence
MAIYHFSVKV ISRATGASAV ASAAYRSASR LHDERLDRDH DFTNKSGVVH SEVMLPDGAP 
EHLSDRATLW NTVEAGEKRK DAQLSREVEF AIPREMDQAQ GIDLARDFVQ REMVDRGMVA
DLNVHWDIGA DGLAKPHAHV MLSMREVGED GFGAKVRDWN RTELVEHWRE AWADHVNERL
AQLDIDARID HRSLQDQGID LEPQHKIGPA AGRRADEGLE AERLDEHHEI ARANGERIIA
DPRIALDAIT KQQATFTRRD LAMFVHRHSD GKEQFDRAMG AVQASPELVA LGQDGRGQDR
FTSREMIAVE DRLHRASALM AERRAHRVSE LDQRRALTRA EQRGLVLSGE QKAAFEHVTA
TRDLGVVVGY AGTGKSALLG VAREAWEDAG YRVQGLALSG IAAENLEGGS GIASRTIASL
EHQWSQGREL LDTRDVLVID EAGMIGSRQM ERLLSAAEKA GAKVVLVGDP EQLQAIEAGA
AFRSIAERHN HVEITQVRRQ HEDWQRDATR HLATGRTGEA LVAYEARGMV HAAETRDQAR
EELVERWDRD RVAEPGKSRI ILTHTNDEVR DLNLTARERL RQAGALGQDV TVKAERGERA
LAAGDRLMFL KNDRGLGVKN GMLGEIEQVS PTQMTVRLDA GRSVAFDLKD YAQVDHGYAA
TIHKSQGVTV DRTHVLATPG MDRHGAYVAL SRHRDSVQLH YGRDEFEDLG KLTRVLSRER
AKDMASDFAE RRDIHVPPSM RPKEPQQRER GMFAGFRPVS PAHQPAAPAR DDDQLARRRS
IERHARAAAD VMRMNDRGLP VLAHQRRTLE QAREAMGKIG PHAAKDLERA YVRDPALAGE
TASGRTQRAI RVLQLEAEVR ADPRLRADRF VERWQGLERQ RTALHRAGDM TGAGQVKDRM
GAMAKSLERD PQVDSLLRAR RPELGLPAEM GRSVGQGLSD YLGIGRGRGL GL