Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1437 |
Symbol | |
ID | 5898892 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 1524307 |
End bp | 1527186 |
Gene Length | 2880 bp |
Protein Length | 959 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641561924 |
Product | tetratricopeptide TPR_4 |
Protein accession | YP_001683065 |
Protein GI | 167645402 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGCTGG GGGCGACTCT GAGATCGACC GTCGCGGCGA TCTGCATCCT CGGGGTCCTG ACCCCGTCGG TGTCGATCTC GGCCGCGCCG AGTCACGTCG CCAAGCCCGC CCCCCGCAAG GCCGCGCCCC ATCCGGCGGC CTCGGGCGGA CTGCAGGTGC GGGTCGCCCA GGCGCCCGAC TTCACGCGAC TGGAGTTCCC CGGCTCGCGC GGCATGCAGG CCCATCGTGA GGGGACGGCC GTGGTGCTGC GCTTCCCGCG CGACGCCAAT CCCAACCTCT CCAGCCTCAA TGTTGGTCCG CCTCGCTGGG TGAAGGGCGC CCAGGCCCGC CATGTCGGCG GCGTGGTCGA GATCGTCGTC GCCCTGAGCG ACGACGCCGA CTTCCGCGCC GGCAACGCCG ATGGCGACGA CTTCATCAAC CTGTTCGCGC GCGCCGAGGC CCCCGCCGCG CCGGCCCCGA CCGCCCCCGT GGTGGCGCGC CGTCCCAATC CGATGCCCGC CAGCGGCGTG GTGCCGGTGA AGTTCGCCAA GGTCGGCGGC CAGGTTCAGC TGAGCTTCGA CTGGGCCAAT CCGGCCGGCG CCGCGGTGTT CCGTCGCGGC GAGGCCGTGT GGGTGGTGTT CGACGCCCCG GCCCGGCTGG ACGTCGGCAG ATTGCCGGCC AGCACGCCGC AATACCGCAA CATCCAGGTC TTCAAGGGGT CCGACTACGC CGCGTTGCGC ATCAGCACGC CGCAAGGCGC GTCGTTCCAC GCCGACGGCG TGGGCGCCGG CTGGACGGTC TCTCTGGGCG CGGGCCTTCA GGACGCGCCC GACATGATCC GCGTGGCCCG CGACGACAAC GCCAGCCCCG CGACCCTGAC CGCCACGGTG GCTGGCGCGA CCAAGGTGAT CTGGGCGGCC GACCCCGCGG TCGGCGACCG CCTCGCCGTC GTCACCGCCC TGGCGCCGGC CAAGGGGCTG CCGTCGCGTC GCGACTATGT CGAGATGGCG CTGCTGCAAT CGGCCACCGG CCTGGCGGTC GAAAGCTATT CCGGCGACCT GCTGATGACC GCCGAAGGCG ACATTGTCAG CATCGGTCGG CCCAAGGGCC TGCTGTTGTC CTCGGCCGTC AGCAGCCACG TGCGCGGCGA CGGCCCGGTC GGCGCGCCCC AGGCCATGGC CATGCCGGCC CTGATCGACC CCGAGCATTG GTCCAAGACC GGGTCCGGCG GATTCATGGC CCGCTACGAC GCCCTGCTCA GCGCCATTCC GGCCGCCAGC GGCGGCGACG GCAAGGGCGA CGACACGGCC GCCCGCATGG CCCTGGCCCG CTTCCTGGTC GGTTCGAACC TGTCCTACGA GGCGATCGGC GTGCTCAACG CCGCCGGCCG CGCCCACCAG ACCCTGATGG GGGACGCCGA GTTCCGCGGT CTGCGTGGCG TGGCCAAGGT GCTGGCGCGC CGCTACGCCG AGGCGGAGGC CGACTTCGCC TCGCCGGTCC TGGCCGACGA TCCTTCAGCG GCCCTGTGGC GCGGCTACGT CGCCAGCAAG TCAGGCCAGT GGAGCGACGC CAGCCAGCGT TTCGCCGAGG GCGCCCGCGC CCTGAACATG TTCCCGGCGC TTTGGCAGGC TCGTTTCCTG GGCGCCCAGG CCGAGGCGGC GGTCGGTGGC GGTGCGGCGG AGGCCGCCAT CCGGCTGGTC AACCAAGCCC TCAAGCTGCC CAACATCCCC GTCGAGGACC AACTGGAGCT GCGCCTGACC CTGGCCAAGG CGTTCGAGCT CAAGGGCGAC TCGCGCGCCC TGCCGGTGTT CCAGGCCATC GCCCGCAGCG GCATCGACCG GCTGGCGGCG CCAGCCACCC TGCACGCCAT CCAGATCCGC CTGGCCAAGA ACCAGGTCAG CGCGGACGTC GCCGCCAAGG GCCTGGACCA ACTTCGCTTC CGCTGGCGCG GCGACGGCGT CGAACTCGAC ACCATCCGCG CCCTGGGCCA ACTGCAGATC AAGCAGGGCC GCTATCGCGA GGCGCTAGAG GTGCTGCATT CGGCCGGCAA CGCCCAGCCT GATCTGCCGC AGGCGGTCGC CCTGCGCAAC GACCTTAACA CCGCCTTCCG GGCGCTGTTC CTTGACGGTC TGGCCGACGG CATGCAGCCG ATCCAGGCGG TGGGCCTGTT CTACGACTTC CAGGACCTGA CCCCGATCGG CGCCGACGGC GACCAGATGG TCCGCAACCT GGTGCGTCGT CTGGTCGACG TCGATCTGCT GGACGAGGCC GCCAAGCTGC TGAAGTACCA GGTCGACAAT CGCCTGAACG GCGTACCCAA GGCCCAGGTG GCCACCGACC TGGCCTGGAT CTACCTGATG GACCGCAAGC CCGAGAGCGC GCTGGACGCC ATCAACGCCA CCCGCACCAC GGTGCTGCCG CCAGCCCTCA ACGCCGAACG TCGCCTGGCC ACCGCCCGCG CCCTGATGGG CCTGGGCCGC TATGACGCCG CACTGGAAGT GGTCGAGAGC GACAAGGGGC GGGACGTCGA GGACGTCCGC GCCGACATCG CCTGGAAGCA GCACGCCTGG CCCCTGGCCG GCGCCGTCTA CGAACGCGCG CTGGGCGACC GCTGGAAGAC GCCCGGTATC CTGACGCCCG CCGACGAGTC CAAGCTGCTG CGTTCGGCCG TGGCCTTCAG CCTGGCCGAC GACGACGCCG CGCTGGGTCG ATTGCGCCAG CGTTACGGCG GCTTCGTCGA CGGCGCCCGT AATCCTGACG GCCTGCGCGT GGCCCTGGCC GGCGCCGACC CGGGCAGCCT GTCCAGCAGC GACTTCAGCA AGGTCACCGC CGACAACGAG GCCTTCAACG GCTGGGTCGC CAAGATGAAG GACCGGTTCC GGGCCCAGGC GAGCAACTAG
|
Protein sequence | MRLGATLRST VAAICILGVL TPSVSISAAP SHVAKPAPRK AAPHPAASGG LQVRVAQAPD FTRLEFPGSR GMQAHREGTA VVLRFPRDAN PNLSSLNVGP PRWVKGAQAR HVGGVVEIVV ALSDDADFRA GNADGDDFIN LFARAEAPAA PAPTAPVVAR RPNPMPASGV VPVKFAKVGG QVQLSFDWAN PAGAAVFRRG EAVWVVFDAP ARLDVGRLPA STPQYRNIQV FKGSDYAALR ISTPQGASFH ADGVGAGWTV SLGAGLQDAP DMIRVARDDN ASPATLTATV AGATKVIWAA DPAVGDRLAV VTALAPAKGL PSRRDYVEMA LLQSATGLAV ESYSGDLLMT AEGDIVSIGR PKGLLLSSAV SSHVRGDGPV GAPQAMAMPA LIDPEHWSKT GSGGFMARYD ALLSAIPAAS GGDGKGDDTA ARMALARFLV GSNLSYEAIG VLNAAGRAHQ TLMGDAEFRG LRGVAKVLAR RYAEAEADFA SPVLADDPSA ALWRGYVASK SGQWSDASQR FAEGARALNM FPALWQARFL GAQAEAAVGG GAAEAAIRLV NQALKLPNIP VEDQLELRLT LAKAFELKGD SRALPVFQAI ARSGIDRLAA PATLHAIQIR LAKNQVSADV AAKGLDQLRF RWRGDGVELD TIRALGQLQI KQGRYREALE VLHSAGNAQP DLPQAVALRN DLNTAFRALF LDGLADGMQP IQAVGLFYDF QDLTPIGADG DQMVRNLVRR LVDVDLLDEA AKLLKYQVDN RLNGVPKAQV ATDLAWIYLM DRKPESALDA INATRTTVLP PALNAERRLA TARALMGLGR YDAALEVVES DKGRDVEDVR ADIAWKQHAW PLAGAVYERA LGDRWKTPGI LTPADESKLL RSAVAFSLAD DDAALGRLRQ RYGGFVDGAR NPDGLRVALA GADPGSLSSS DFSKVTADNE AFNGWVAKMK DRFRAQASN
|
| |