Gene Caul_2017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2017 
Symbol 
ID5899472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2160178 
End bp2162166 
Gene Length1989 bp 
Protein Length662 aa 
Translation table11 
GC content71% 
IMG OID641562506 
Producthypothetical protein 
Protein accessionYP_001683643 
Protein GI167645980 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3843] Type IV secretory pathway, VirD2 components (relaxase) 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.985162 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCAGG ACGACGATTT CGAACCCCGC CTGGGCAAGC TTCGGGCGAT CGGCGGCACG 
CCCAAGAGCT ATCTGTCGCG CGTGCTGCGT TCGGCCATGC TGGCCAGCGG CGGGACCTTC
GGCAGGGCGC CGCGGCGGAC GGGGTTCACC GGCGCGCGGA TCGGCAAGGG CGCGGGTGTG
GGGCGGGTGC TGGCGGCGCG CGATCACTTC GCCGCCTATC GATCGCGCCG GGTGATCGTC
AAGTTCTCGA TCCCCAAGCT GACGGGGAAG GGGATCGCCG CGGCCCGGGC CCACATGCGC
TACGTCCAGC GCGACGGCGT CACCCGCGAG GGCGCGCCTG GCCAGCTCTA TGGGGCCGAG
ACCGACAAGG CCGACGGGCG GGCCTTCATC GACCGCGCCG AAGCCGGCGA CGACGCCCGG
CAGTTTCGGT TCATCGTCTC GGCCGAGGAC GGCGCGGAGT ACGAGGATCT CAAGCCCCTG
ACCCGCCGGC TGATGGCGCA GATGGAACAG GATCTCGACA CCAAGCTCGA CTGGGTGGCG
GTCGATCACC ACAACACCGG CCATCCGCAC ACCCACATCA TCGTGCGCGG GCGCAATGAC
CGGGGCGCTG ATCTCGTCAT CGCCAAGGAA TACCTGACCC AGGGCCTGCG CGAGCGCGCC
GCCGAGCTGG TGTCGCTTGA CCTGGGACCG CGCGGCGATC TGGAGATCGA GGACCGGCTT
CGTCGCGAGG TTGGTCAGGA GCGGCTGACC AGCCTGGACC GGGGATTGCT GCGCGACGTC
GCCGAGGACG GGCTGGTCTC GCCGCGTCAT GTCGATCCGC GCCTCTATGC CTTGCGCGCC
GGACGTCTGC AGACCCTGGC GCGGTTGGGT CTGGCCCAGG AGGAGGCGCC CACGCGGTGG
CGGCTGGCCG AGGGCCTGGA AGACACCCTG CGGCGGATGG GCGAACGCGG CGACATCCAG
AAGACCCTCT ATCGGGCCCT GACGCAGGGC GGGATCGACC GCGACCGCGC CGACCAAGTG
ATCTATCAGC CGGCGGCGGC CGACGCCCGT CCGCTCGTCG GCCGCCTGGT GGCGCGAGGC
CTGTCGGATG AACTGAAGGA TCGCCACTAC CTGGTCGTCG ACGGCGTCGA TGGCCGCGCC
CACTATGTCG AGATCGGGCG CGGGCTGGCG ACCGACCCAA TCCCCGAGGG CGCGGTGATC
CGGGTCGAGC CGCGCTCGAC GCAAGCCCGG GCCGTCGATC ACACCGTCGC CGCGATCGCC
CAGGCCAATG GTGGGCGCTA CAGTGTCGAT CTGCACCTGG CCACCGATCC GACGGCCTCG
GCCGGCTTCG CCGAGGCCCA CATCCGACGC CTGGAAGCGA TGCGGCGCGG CCGCGCGTCT
GTCGAGCGCC AGGAGGACGG GAGCTGGATC ATCGGCGCCG ACCATATCGA GCAGGCTGCC
GCCTTCGAAA AGCGGCAGGC GGAGTCGGCG CCGGTGGTGG TCACGACCCT CAGCGCCAGG
CCGCTTGACC AGCAGGTCGG CGTGGCCGGC GCGACCTGGC TTGATCGCGA GCTGATCGCT
GAGGCGCCCG AACCTCTGCG CGACGCGGGC TTTGGCCGGC AGGCCCGTCA GGCCTTGGCC
TTGCGACGCC AGTGGCTGAT CGACCAGGGC CTGGCCCGCC AGGAACAGGA CCAGGTGATC
TATCGCGCTG GTCTGCTGAC CCGCCTGCAG CGGCGCGACC TGATCGTCGC CGCCGAGGGG
TTGGGGCGCG AGACGGGGCT GAACTTCTCC GAAGCGCGCC CCGGCCAACG GATCGAGGGC
GTCTACCGGC GGTCGGTGGA CCTGGCCAGC GGCCGGTTCG CGGTGATCGA GCGCAGCCGC
GATTTCACGC TGGTTCCATG GAAGCGGGCG CTGGAGGGAC AGGAGGGGCG CGCTGTGTCT
GGCGTGCTGC GCGAGGCAGG CGTCAGTTGG ACGATCGGGC GCGGGCGGGG CGGACCCTCG
ATCTCCTAG
 
Protein sequence
MAQDDDFEPR LGKLRAIGGT PKSYLSRVLR SAMLASGGTF GRAPRRTGFT GARIGKGAGV 
GRVLAARDHF AAYRSRRVIV KFSIPKLTGK GIAAARAHMR YVQRDGVTRE GAPGQLYGAE
TDKADGRAFI DRAEAGDDAR QFRFIVSAED GAEYEDLKPL TRRLMAQMEQ DLDTKLDWVA
VDHHNTGHPH THIIVRGRND RGADLVIAKE YLTQGLRERA AELVSLDLGP RGDLEIEDRL
RREVGQERLT SLDRGLLRDV AEDGLVSPRH VDPRLYALRA GRLQTLARLG LAQEEAPTRW
RLAEGLEDTL RRMGERGDIQ KTLYRALTQG GIDRDRADQV IYQPAAADAR PLVGRLVARG
LSDELKDRHY LVVDGVDGRA HYVEIGRGLA TDPIPEGAVI RVEPRSTQAR AVDHTVAAIA
QANGGRYSVD LHLATDPTAS AGFAEAHIRR LEAMRRGRAS VERQEDGSWI IGADHIEQAA
AFEKRQAESA PVVVTTLSAR PLDQQVGVAG ATWLDRELIA EAPEPLRDAG FGRQARQALA
LRRQWLIDQG LARQEQDQVI YRAGLLTRLQ RRDLIVAAEG LGRETGLNFS EARPGQRIEG
VYRRSVDLAS GRFAVIERSR DFTLVPWKRA LEGQEGRAVS GVLREAGVSW TIGRGRGGPS
IS