Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2017 |
Symbol | |
ID | 5899472 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 2160178 |
End bp | 2162166 |
Gene Length | 1989 bp |
Protein Length | 662 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641562506 |
Product | hypothetical protein |
Protein accession | YP_001683643 |
Protein GI | 167645980 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3843] Type IV secretory pathway, VirD2 components (relaxase) |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.985162 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCAGG ACGACGATTT CGAACCCCGC CTGGGCAAGC TTCGGGCGAT CGGCGGCACG CCCAAGAGCT ATCTGTCGCG CGTGCTGCGT TCGGCCATGC TGGCCAGCGG CGGGACCTTC GGCAGGGCGC CGCGGCGGAC GGGGTTCACC GGCGCGCGGA TCGGCAAGGG CGCGGGTGTG GGGCGGGTGC TGGCGGCGCG CGATCACTTC GCCGCCTATC GATCGCGCCG GGTGATCGTC AAGTTCTCGA TCCCCAAGCT GACGGGGAAG GGGATCGCCG CGGCCCGGGC CCACATGCGC TACGTCCAGC GCGACGGCGT CACCCGCGAG GGCGCGCCTG GCCAGCTCTA TGGGGCCGAG ACCGACAAGG CCGACGGGCG GGCCTTCATC GACCGCGCCG AAGCCGGCGA CGACGCCCGG CAGTTTCGGT TCATCGTCTC GGCCGAGGAC GGCGCGGAGT ACGAGGATCT CAAGCCCCTG ACCCGCCGGC TGATGGCGCA GATGGAACAG GATCTCGACA CCAAGCTCGA CTGGGTGGCG GTCGATCACC ACAACACCGG CCATCCGCAC ACCCACATCA TCGTGCGCGG GCGCAATGAC CGGGGCGCTG ATCTCGTCAT CGCCAAGGAA TACCTGACCC AGGGCCTGCG CGAGCGCGCC GCCGAGCTGG TGTCGCTTGA CCTGGGACCG CGCGGCGATC TGGAGATCGA GGACCGGCTT CGTCGCGAGG TTGGTCAGGA GCGGCTGACC AGCCTGGACC GGGGATTGCT GCGCGACGTC GCCGAGGACG GGCTGGTCTC GCCGCGTCAT GTCGATCCGC GCCTCTATGC CTTGCGCGCC GGACGTCTGC AGACCCTGGC GCGGTTGGGT CTGGCCCAGG AGGAGGCGCC CACGCGGTGG CGGCTGGCCG AGGGCCTGGA AGACACCCTG CGGCGGATGG GCGAACGCGG CGACATCCAG AAGACCCTCT ATCGGGCCCT GACGCAGGGC GGGATCGACC GCGACCGCGC CGACCAAGTG ATCTATCAGC CGGCGGCGGC CGACGCCCGT CCGCTCGTCG GCCGCCTGGT GGCGCGAGGC CTGTCGGATG AACTGAAGGA TCGCCACTAC CTGGTCGTCG ACGGCGTCGA TGGCCGCGCC CACTATGTCG AGATCGGGCG CGGGCTGGCG ACCGACCCAA TCCCCGAGGG CGCGGTGATC CGGGTCGAGC CGCGCTCGAC GCAAGCCCGG GCCGTCGATC ACACCGTCGC CGCGATCGCC CAGGCCAATG GTGGGCGCTA CAGTGTCGAT CTGCACCTGG CCACCGATCC GACGGCCTCG GCCGGCTTCG CCGAGGCCCA CATCCGACGC CTGGAAGCGA TGCGGCGCGG CCGCGCGTCT GTCGAGCGCC AGGAGGACGG GAGCTGGATC ATCGGCGCCG ACCATATCGA GCAGGCTGCC GCCTTCGAAA AGCGGCAGGC GGAGTCGGCG CCGGTGGTGG TCACGACCCT CAGCGCCAGG CCGCTTGACC AGCAGGTCGG CGTGGCCGGC GCGACCTGGC TTGATCGCGA GCTGATCGCT GAGGCGCCCG AACCTCTGCG CGACGCGGGC TTTGGCCGGC AGGCCCGTCA GGCCTTGGCC TTGCGACGCC AGTGGCTGAT CGACCAGGGC CTGGCCCGCC AGGAACAGGA CCAGGTGATC TATCGCGCTG GTCTGCTGAC CCGCCTGCAG CGGCGCGACC TGATCGTCGC CGCCGAGGGG TTGGGGCGCG AGACGGGGCT GAACTTCTCC GAAGCGCGCC CCGGCCAACG GATCGAGGGC GTCTACCGGC GGTCGGTGGA CCTGGCCAGC GGCCGGTTCG CGGTGATCGA GCGCAGCCGC GATTTCACGC TGGTTCCATG GAAGCGGGCG CTGGAGGGAC AGGAGGGGCG CGCTGTGTCT GGCGTGCTGC GCGAGGCAGG CGTCAGTTGG ACGATCGGGC GCGGGCGGGG CGGACCCTCG ATCTCCTAG
|
Protein sequence | MAQDDDFEPR LGKLRAIGGT PKSYLSRVLR SAMLASGGTF GRAPRRTGFT GARIGKGAGV GRVLAARDHF AAYRSRRVIV KFSIPKLTGK GIAAARAHMR YVQRDGVTRE GAPGQLYGAE TDKADGRAFI DRAEAGDDAR QFRFIVSAED GAEYEDLKPL TRRLMAQMEQ DLDTKLDWVA VDHHNTGHPH THIIVRGRND RGADLVIAKE YLTQGLRERA AELVSLDLGP RGDLEIEDRL RREVGQERLT SLDRGLLRDV AEDGLVSPRH VDPRLYALRA GRLQTLARLG LAQEEAPTRW RLAEGLEDTL RRMGERGDIQ KTLYRALTQG GIDRDRADQV IYQPAAADAR PLVGRLVARG LSDELKDRHY LVVDGVDGRA HYVEIGRGLA TDPIPEGAVI RVEPRSTQAR AVDHTVAAIA QANGGRYSVD LHLATDPTAS AGFAEAHIRR LEAMRRGRAS VERQEDGSWI IGADHIEQAA AFEKRQAESA PVVVTTLSAR PLDQQVGVAG ATWLDRELIA EAPEPLRDAG FGRQARQALA LRRQWLIDQG LARQEQDQVI YRAGLLTRLQ RRDLIVAAEG LGRETGLNFS EARPGQRIEG VYRRSVDLAS GRFAVIERSR DFTLVPWKRA LEGQEGRAVS GVLREAGVSW TIGRGRGGPS IS
|
| |