Gene Caul_2635 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2635 
Symbol 
ID5900090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2856968 
End bp2857903 
Gene Length936 bp 
Protein Length311 aa 
Translation table11 
GC content66% 
IMG OID641563126 
Producttransglutaminase domain-containing protein 
Protein accessionYP_001684260 
Protein GI167646597 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.000599705 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGTAC TTTCGGTTCG CCATCTGACG CGTTACCAGT ACCGCAATCC GGTGGCGTTT 
GGCGATCATC GCATGATGTT TCGCCCGCGC GAAGGTTGCG ACCAGCGCGT GCTGTCGTCG
AGCCTGGTCA TCAGCCCGAC GCCGACCCAA ATGCGCTACG TGCAGGATGT GTTCGGCAAC
TGGGTCGGGA TCGCCCGCTT CCATGGCCGC GCCAGCGAGC TGAGCTTCGA GAGCCAGATC
GTCCTGGACC ATACGCCCCT GCCAGCCTTC GGCGACCAGC CGGGCGACAT CGACGTCCAT
GCGCACGGCC AGCCGATCGC CTATGCGAGC GAAGACCTGC CCGACCTTAT GCAGTCGATC
GAGCTCCAGC ATGACGATCC GGGCGGCGTG GTGGCCGCGT GGGCCCAGCG CTTCGTCAGA
CGGAGCGGCT CGACCAGCCT GCAACACCTG TTGGCCGAGA TGACCCAGGC GATCTATGCC
GACCTTAAGT ACGGCAAGCG CCTTGAACAT GGCGTGCAGA CCCCGCTGAC CACGCTCGAG
ACCAAGAGCG GCACCTGTCG CGATTTCGCC GTGCTGATGA TCGAGGCCGT TCGCAGCCTG
GGCCTGGCCG CCCAGTTCGT GTCCGGCTAC ATCCACAGCC CGTCGCGCAA GCTCGAACCA
GAGGTTCGCA CCGGCGGCGG CCACACCCAT GCCTGGCTGC GTGTCTATCT GCCCTCGTGC
GGCTGGGTCG AGTTCGATCC GACCAACGGC ATCGTCGGCA ACACCGATCT GATCCGTGTC
GCCGTGGTGC GCGATCCACG CCAAGCCGCG CCCCTCCATG GCACATGGGC GGGCTTTCCG
ACCGACTATC TGGGCATGGA AGTCGAAGTC GACGTGACCA GTCTTGAGGC GGACTACGCT
CACCTCGGGG CGCCCCTGGC CATGGCCGTC GGCTAG
 
Protein sequence
MPVLSVRHLT RYQYRNPVAF GDHRMMFRPR EGCDQRVLSS SLVISPTPTQ MRYVQDVFGN 
WVGIARFHGR ASELSFESQI VLDHTPLPAF GDQPGDIDVH AHGQPIAYAS EDLPDLMQSI
ELQHDDPGGV VAAWAQRFVR RSGSTSLQHL LAEMTQAIYA DLKYGKRLEH GVQTPLTTLE
TKSGTCRDFA VLMIEAVRSL GLAAQFVSGY IHSPSRKLEP EVRTGGGHTH AWLRVYLPSC
GWVEFDPTNG IVGNTDLIRV AVVRDPRQAA PLHGTWAGFP TDYLGMEVEV DVTSLEADYA
HLGAPLAMAV G