Gene Caul_3498 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3498 
Symbol 
ID5900953 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3773302 
End bp3776184 
Gene Length2883 bp 
Protein Length960 aa 
Translation table11 
GC content58% 
IMG OID641564004 
ProductPhage-related protein tail component-like protein 
Protein accessionYP_001685123 
Protein GI167647460 
COG category[S] Function unknown 
COG ID[COG4733] Phage-related protein, tail component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCAAAG TCATCATCAA GGGACCGCTA GCTAAGAAGC TCCCAGCCGC TTATCGCAAG 
TCGATCAGCT TTAAGGTCTC AACGATCCGC GAACTCCTGT GGGGGCTCGA TCAGTTCGGT
TCGTTCTCCG CTGACCTAGA CGCTTTCCCG CACCAGTTCT TCGTCGGAAA GACGCTCAAG
ACCGCTCACA GGTTGTCAGC CCAAGAAGCT GCCGCTGTGA AGCTGGACGA GCAAGAAGGC
GTCTGCGTCT TCATCGTTCC ATGCGTCGCG GGTGCCGATC CCGTGACGAC AACGATGGTC
GTTACAGCGC TTGTCAGTGC TGCAATCTCC ATCGGTATCA GCCTCATCAT GGCGGTTCTG
TTTCCGCCGA CACAGGGAGG CAACGACACT CGAAAATCAG CCCTCTACGT CAACGGCATG
AACACTCAAC AGGAAGGTAT TCCGATCCCT GTTGCGTATG GCACTGACAT CTTCTGCGGC
TTCAACGTCA TCGAGGCCGA TGTCGAAGTC CTGAACACGG GCGGCATTTC GGGTTGGATG
AACCCGCTAT CGGGTGCGCT CGCGGTCAAT GCTTCAGCGG GAACCGGCCT CTACGGATCG
GGCGCTAACA ACTACGTCGA TCCCGCATGG GACATCCTGA AGGGCCTCGG CCTGAACGTG
GATGGCTTGG GCGCAAAGGG CGGTGGCAAG ACCATCGCGA ACAACACCTT CTCCAACGCC
TCCATGAAAA TCCTAGGCGC AGTCAGTGCG GGTCCAATCG GCGGATTGGT CGGTAACACG
GTCGAAGAGC AGGAAAGCTC CATCTTCATC GACAAGACCG CCCTGCGTGA CGCAGCGAGC
GGCAAGCTCT CCAAGCAGGG CTTCACATGG GCGACACGTC CAGGCGAGAC CGGCCAATCT
GCCGTGCCTA TCACAGAGGC CATCGCAACG CCTTACGACG CTAGCCAGGA GCTTAAGAAG
CTCACCAGTG CAGGCATCCC AACCGACATT CAGAAGACGT TCCCGAACGA CATCAACCGC
TCGCGCTTCC GCTTCAACAT GGTCCTTCTT CAGACCAGCA AGAAGGGCAA CCAGAGCAAC
GCGACTGTCG TGCTTGGTGC AGACTTCAAG CGCATCACAG ACACTTCATG GACTCCTTAT
GCGAACTGGA CCTTCAACGG CAAAACCAGC ACGGGTGCCC AGCGCGAGAT TCAGGTGATT
GCTCCGAATT GGAAGACCGA TGAGGAGTGG CAAGTTCGCA TCTATCGCGT CACCGAGGAC
AGCACCGACG ACAAAATCCA GAACGCAACG ACGTTCAACG GCTGGGTCGA GATCATCGAC
AAAGACCTAG CCTATGACGG TACGGCGGAC TCACCGCCTA CAGCGCTATT CGGTGCAAGC
ATCGATGTCA GCCAGTTCGA CAGCGGCTCC AAGCCCGAGA TCATCCTACG CAGCGCAGGC
CGAAAGGTCC GCGTTCCTCA TGTCGTTGGC TCCTATTGGG ATGGATCATG GGACACCAAA
GTCACAGCCA ATCCGGTCTG GATTTGGTTC GACATGGCGA CAGACAAGCT CGTCGGCGGT
GGCCTCTCCG ACACATGGTT CAACCGCTTT GAACTCTACG AGATCGCGCA GTTCTGCGAC
CAACTCGTCA ATGGTCGTCC GCGCTTCACG CTGAACAAGC AATTCACCGA CAGCAAGGAA
CTCTGGCAGC AGCTACGCGA AGTCGCGCAG TCCTTCATGG CCGTGGCCTA TTGGAACGGT
TCGTCGGTGT CGCTTGTGCA GGACACGCCA AACGCGACGG TCAGCCACTA TATCACGAAC
ACGATGGTTG AAGATGGCGC GTTTGCGTAC TCGTTCACCG ATCACATCGA GCGCTTCAAC
GAAATCCTTG TCGAGTACGA CGACCCAACG CAGTTCGGCG CGAAGGGCAT TGCACCTTGG
CAGGATGACG CGGCGATCAC TCGCGCCCGC GCGCTCAACT TCCCCAACGA TGGGAAGATC
ACAAACACGA TCTACAAGAC CGGCTGCACC AACGCACAGG AGGCCTATGA CTGGGCTCGC
CTGCTTGGGT ATGCCTCGCA GCGGGAAGTT AGGAACGTCG CGTTCGCCGC GCCTATCGCG
GGCTCCACCT ACTTCCCTGG CCAGATCATC GAAATCGACG ACATGAACGT GTCGGGCAAA
GAGCCCGTGG GTCGTGTCGC ACGCATCATC GACGCTGACC ATATCGAGCT AGACGCTCCC
GTGACCTTGG AAGCGTTCAA GTCCTACACC CTACGCATCG TCGGTGAGAC CGTCCGTTCG
ATCTCGCTGC CAATGCTCAC CGTGACCACG AAGAGCGCCA CTATCAACGC TCCTGCTCAC
GGCGCGGTCG TTCAAGCTCC GGTCGGATTG ATCGAGCTTG GCACGGGTCC ACAGCCACAG
CGCTTCCGCA TAATCGAAGT GAGCGAAGCC GGTCCCGCGA AGTACGAGGT CAAGGCGCAG
CAGAGCATTA TCGGCAAGTG GGAAGAGGTC GAACAGCACG TCCCTGTTCC AATCCCAGAT
TGGACAAACC GTGACACGTC GGTTCGCGCT CCAACGAACA TCGTGTTCAC GCCGCACGCG
GCAGAAGACG ACATCACGGG CTCCTATACG TCACTCGAAA TCTCATGGAC CGGCCTGCCA
ACCGGCGCGC TCGTTCGCGA GTACGTCCTT GAAGCAACCA TGCCCGATGG CGGTGTTTCA
GTCGAACTCT ACAGAGGCGC AAACACGGGC TTTACGATCT CGCGTGCGCC TCCTGGCCTC
TACGTCGTGT CGGTCAAAAC GATAAATATG CTTGGTGGCA GCAGCGAGCC TCTGTCTGGA
AGCTACGTGC TTTCGACAGG CGATGAACTC GTCTTCCCAC CAGTCTTTAT AGGGTTTGAT
TGA
 
Protein sequence
MVKVIIKGPL AKKLPAAYRK SISFKVSTIR ELLWGLDQFG SFSADLDAFP HQFFVGKTLK 
TAHRLSAQEA AAVKLDEQEG VCVFIVPCVA GADPVTTTMV VTALVSAAIS IGISLIMAVL
FPPTQGGNDT RKSALYVNGM NTQQEGIPIP VAYGTDIFCG FNVIEADVEV LNTGGISGWM
NPLSGALAVN ASAGTGLYGS GANNYVDPAW DILKGLGLNV DGLGAKGGGK TIANNTFSNA
SMKILGAVSA GPIGGLVGNT VEEQESSIFI DKTALRDAAS GKLSKQGFTW ATRPGETGQS
AVPITEAIAT PYDASQELKK LTSAGIPTDI QKTFPNDINR SRFRFNMVLL QTSKKGNQSN
ATVVLGADFK RITDTSWTPY ANWTFNGKTS TGAQREIQVI APNWKTDEEW QVRIYRVTED
STDDKIQNAT TFNGWVEIID KDLAYDGTAD SPPTALFGAS IDVSQFDSGS KPEIILRSAG
RKVRVPHVVG SYWDGSWDTK VTANPVWIWF DMATDKLVGG GLSDTWFNRF ELYEIAQFCD
QLVNGRPRFT LNKQFTDSKE LWQQLREVAQ SFMAVAYWNG SSVSLVQDTP NATVSHYITN
TMVEDGAFAY SFTDHIERFN EILVEYDDPT QFGAKGIAPW QDDAAITRAR ALNFPNDGKI
TNTIYKTGCT NAQEAYDWAR LLGYASQREV RNVAFAAPIA GSTYFPGQII EIDDMNVSGK
EPVGRVARII DADHIELDAP VTLEAFKSYT LRIVGETVRS ISLPMLTVTT KSATINAPAH
GAVVQAPVGL IELGTGPQPQ RFRIIEVSEA GPAKYEVKAQ QSIIGKWEEV EQHVPVPIPD
WTNRDTSVRA PTNIVFTPHA AEDDITGSYT SLEISWTGLP TGALVREYVL EATMPDGGVS
VELYRGANTG FTISRAPPGL YVVSVKTINM LGGSSEPLSG SYVLSTGDEL VFPPVFIGFD