Gene Caul_2196 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2196 
Symbol 
ID5899651 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2388325 
End bp2389305 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content67% 
IMG OID641562688 
Producttransglutaminase domain-containing protein 
Protein accessionYP_001683822 
Protein GI167646159 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.327421 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.29997 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGCTCG AGATCCGTCA TATCACCCAG TACCACTATG AGCGGCCGGT CCGCGAAAGC 
CTGATGGAGC TGTGGATGCA GCCCCAGAAG AGCGCGCGGC AACGACTGGT CAGCTTCGAG
CTGGAGATCG AACCCGCCGC CCAAGTGTTC TCCTATGCCG ACAGCTTCGG CAACGCGGTC
TATCACTTCG ACGTGCCCCA GCCGCACGAC GGCCTGACCA TCATCGCCCG TTCGGCGGTC
GAGACCGAGC CGGTGCATGA GTGGCCGACG CCGCTGGACA TGGGCGAATG GGAACGGCTG
AAGAGCGCCT TCGTGCGCGG CGAATGCTTC GACTACCTGC AACCCCACGG TTTCGTCAGG
GTCACCTCGG CTCTGGAGAC CTTCATGGTC GAGCATGACA TCGACGACCT GCGGCGGCGG
GATCCGTTCT CCGCGGTCCA GGCCCTGTGC GACACGATCT ACAAGGCCTT CGAATACCAG
CCGGGCGTCA CAGACGCCGA TAGTCCGATC GATCTGGCGC TCAGCGCCGG CCGGGGCGTC
TGCCAGGACT TCGCCCACAT CATGCTGGCC ATCTGCCGCG CCTGGGGCAT CCCGGCCCGC
TACGTATCAG GTTATCTGTT CACCGACCGC GAGGCCGGTG ATCGCTCCGA TCCCGACGCC
ACCCACGCCT GGGTCGAGGT TTTCCTGCCG AGCTACCGCT GGATCGGTTT TGATCCGACC
AACAACGTCA TGACGGGCGA GCGGCACGTG GCCGTGGCGG TCGGGCGCGA CTATGGCGAC
GTCACGCCGT CGCGCGGCGT CTACAAGGGC GATTCCGAGA GTCATCTGGC GGTCGGGGTC
TCGGTGCGGC GGGCTCGGGC GGCCGTGGCC GAGCCGGAGT TCCTGCGCCT GCCGCGTCCC
GCCGCGCCTT TCGGTCCGGG TCGCCGCCGC GCCAGCGCCG ACGCCATCCG CGAGCACCAG
CAGCGCCAGC AACAGCAGTA G
 
Protein sequence
MLLEIRHITQ YHYERPVRES LMELWMQPQK SARQRLVSFE LEIEPAAQVF SYADSFGNAV 
YHFDVPQPHD GLTIIARSAV ETEPVHEWPT PLDMGEWERL KSAFVRGECF DYLQPHGFVR
VTSALETFMV EHDIDDLRRR DPFSAVQALC DTIYKAFEYQ PGVTDADSPI DLALSAGRGV
CQDFAHIMLA ICRAWGIPAR YVSGYLFTDR EAGDRSDPDA THAWVEVFLP SYRWIGFDPT
NNVMTGERHV AVAVGRDYGD VTPSRGVYKG DSESHLAVGV SVRRARAAVA EPEFLRLPRP
AAPFGPGRRR ASADAIREHQ QRQQQQ