Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2196 |
Symbol | |
ID | 5899651 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 2388325 |
End bp | 2389305 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641562688 |
Product | transglutaminase domain-containing protein |
Protein accession | YP_001683822 |
Protein GI | 167646159 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.327421 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.29997 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTGCTCG AGATCCGTCA TATCACCCAG TACCACTATG AGCGGCCGGT CCGCGAAAGC CTGATGGAGC TGTGGATGCA GCCCCAGAAG AGCGCGCGGC AACGACTGGT CAGCTTCGAG CTGGAGATCG AACCCGCCGC CCAAGTGTTC TCCTATGCCG ACAGCTTCGG CAACGCGGTC TATCACTTCG ACGTGCCCCA GCCGCACGAC GGCCTGACCA TCATCGCCCG TTCGGCGGTC GAGACCGAGC CGGTGCATGA GTGGCCGACG CCGCTGGACA TGGGCGAATG GGAACGGCTG AAGAGCGCCT TCGTGCGCGG CGAATGCTTC GACTACCTGC AACCCCACGG TTTCGTCAGG GTCACCTCGG CTCTGGAGAC CTTCATGGTC GAGCATGACA TCGACGACCT GCGGCGGCGG GATCCGTTCT CCGCGGTCCA GGCCCTGTGC GACACGATCT ACAAGGCCTT CGAATACCAG CCGGGCGTCA CAGACGCCGA TAGTCCGATC GATCTGGCGC TCAGCGCCGG CCGGGGCGTC TGCCAGGACT TCGCCCACAT CATGCTGGCC ATCTGCCGCG CCTGGGGCAT CCCGGCCCGC TACGTATCAG GTTATCTGTT CACCGACCGC GAGGCCGGTG ATCGCTCCGA TCCCGACGCC ACCCACGCCT GGGTCGAGGT TTTCCTGCCG AGCTACCGCT GGATCGGTTT TGATCCGACC AACAACGTCA TGACGGGCGA GCGGCACGTG GCCGTGGCGG TCGGGCGCGA CTATGGCGAC GTCACGCCGT CGCGCGGCGT CTACAAGGGC GATTCCGAGA GTCATCTGGC GGTCGGGGTC TCGGTGCGGC GGGCTCGGGC GGCCGTGGCC GAGCCGGAGT TCCTGCGCCT GCCGCGTCCC GCCGCGCCTT TCGGTCCGGG TCGCCGCCGC GCCAGCGCCG ACGCCATCCG CGAGCACCAG CAGCGCCAGC AACAGCAGTA G
|
Protein sequence | MLLEIRHITQ YHYERPVRES LMELWMQPQK SARQRLVSFE LEIEPAAQVF SYADSFGNAV YHFDVPQPHD GLTIIARSAV ETEPVHEWPT PLDMGEWERL KSAFVRGECF DYLQPHGFVR VTSALETFMV EHDIDDLRRR DPFSAVQALC DTIYKAFEYQ PGVTDADSPI DLALSAGRGV CQDFAHIMLA ICRAWGIPAR YVSGYLFTDR EAGDRSDPDA THAWVEVFLP SYRWIGFDPT NNVMTGERHV AVAVGRDYGD VTPSRGVYKG DSESHLAVGV SVRRARAAVA EPEFLRLPRP AAPFGPGRRR ASADAIREHQ QRQQQQ
|
| |