Gene Caul_1662 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1662 
Symbol 
ID5899117 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1740701 
End bp1741720 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content68% 
IMG OID641562151 
Product2'-deoxycytidine 5'-triphosphate deaminase 
Protein accessionYP_001683289 
Protein GI167645626 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0717] Deoxycytidine deaminase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.170233 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.344824 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACG CCCACGCAGC GGGGATTCTG CCCTGCCAAT CCATCGAGGA CCTGATCGCG 
CAAGAAGCGA TCACCTCCCA GACTCCGTTC GACACCGACC AGGTGCAGCC GGCCAGCCTC
GACCTGCGCC TGGGCGCGCG GTGCTGGCGG GTGCGGGCCT CGTTCCTGCC CGGCGTGGCC
CGCCGCGTGC CCGAGCGTCT GAAGGACGTG GCGATGCACG AGCTGGACCT GACCAAGGGC
GCGGTGCTGG AAAAGGGCTG CGTCTACATC GCCGAGATCC AGGAGCGCCT GGCCTTGCCG
GGCAATATCG CGGCGCGCGG CAATCCCAAG AGTTCGACGG GCCGGGTCGA TGTGTTCGTG
CGCCTGCTCA GCGACCACTC CAAGGCCTTC GACGACATCG AGGCCGGCTA TGAAGGGCCG
CTCTATATCG AGATCGCGCC GCAGACCTTC TCGGTGCTGG TGCGGTCCGG CACCCGGCTC
AATCAGCTGC GCCTGAAGCG CGGCGAGCCG GTCAAGCTGG CGACCCGCAG TGTCGGCGTC
GACCTGACCA CGGGCATCGT CGGCTTCCGG GGCCGTCGCC ACGCCGGGGT CATCGACCTG
GACCACGAGG ACGGCCACGA TCCGCGCGAC TTCTGGGAGC CGCTGGAGAC CCGTCACGGC
GAGCTGCTGC TCGATCCGGG CGAGTTCTAC ATCCTGGCCT CCAAGGACAA TATCGAGATC
CCGGTGCTGG AAGCGGCCGA GATGACGCCG ATCGATCCGT CGGTCGGCGA GTTCCGGGTC
CACTATGCCG GCTTCTTCGA CCCCGGCTTC GGGACAGAGG AGGCCGGCGC GGTCGGCTCC
AAGGGCGTGC TGGAGGTGCG CAGCCACGAG ACCCCGTTCC TGCTGGAGGA CGGTCAGATC
GTCGCCCGGC TGGTCTACGA GCCGCTGACC GCGCGGCCCG ACCGGCTCTA TGGCGAGGGC
GGTTCGCACT ATCAGCGCCA AGGTCTGAAA TTGTCCAAGC ATTTCAAGGC CTGGCGTTAG
 
Protein sequence
MTDAHAAGIL PCQSIEDLIA QEAITSQTPF DTDQVQPASL DLRLGARCWR VRASFLPGVA 
RRVPERLKDV AMHELDLTKG AVLEKGCVYI AEIQERLALP GNIAARGNPK SSTGRVDVFV
RLLSDHSKAF DDIEAGYEGP LYIEIAPQTF SVLVRSGTRL NQLRLKRGEP VKLATRSVGV
DLTTGIVGFR GRRHAGVIDL DHEDGHDPRD FWEPLETRHG ELLLDPGEFY ILASKDNIEI
PVLEAAEMTP IDPSVGEFRV HYAGFFDPGF GTEEAGAVGS KGVLEVRSHE TPFLLEDGQI
VARLVYEPLT ARPDRLYGEG GSHYQRQGLK LSKHFKAWR