Gene Caul_4368 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4368 
Symbol 
ID5901829 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4744378 
End bp4745430 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content74% 
IMG OID641564886 
ProductA/G-specific adenine glycosylase 
Protein accessionYP_001685986 
Protein GI167648323 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.468133 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGACA TCCCCGCCCT CCGCGCCGCC CTGCTGGCCT GGTACGACGC CCAGGCGCGG 
GACCTGCCCT GGCGGACCGG GCCGGCGGCC GGCAAGGCGG GACAGCGGTC CGACCCTTAC
CGCGTCTGGC TGTCGGAGGT GATGCTGCAG CAGACCACCG TGCCGCACGC CACGCCCTAT
TTCCTGAGTT TCACCCAGCG CTGGCCGACG GTCTCAAGCC TGGCGGCGGT GGCGGACGAC
GACCTGATGG CCGCCTGGGC GGGCCTGGGC TACTACGCCC GCGCCCGCAA CCTTCTGGCC
TGCGCCCGGG CCGTGGCGGC TGAGCACGGC GGGGTGTTTC CCGACACCGA GGCGGCCCTG
CGCGCCCTGC CGGGCGTCGG CGCCTACACC GCCGCCGCCG TTGCGGCCAT CGCCTTCGAC
CGCGAGGCCA ACGTGGTCGA CGGCAATGTC GAGCGGGTGA TGGCGCGGCT GTTCGCGGTG
GAAGACCCCG TGCCCGACGC CAAGCCGGAG CTGAAGCGCC TGGCCGGCGA GCTGGTCACC
GCCGCGCGTC CCGGCGACTG GGCCCAGGCG CTGATGGACC TGGGCGCGAC GGTGTGCCGG
CCCAAGGGTC CGCTGTGCGA CCGCTGCCCG GTCTCGGCCT GGTGCGAGGG CTTCAAGACC
GGCGCGCCGG AGACCTATCC GCGCAAGACG AAGAAGGCCG AACGGCCTCG CCGCTACGGG
GTGGCCTATG TCCTGACGCG GGGCGAGGCC ACGGCCCTGG TCCGCCGCCC GCCCAAGGGC
CTGCTGGGCG GGATGCTGGG CCTGCCGACC AGCGACTGGC GCGATCGTCC GTGGACGGAT
TTCGAAGCCG CCGCCACCGC GCCGGCGGCC GGCGCCTGGC GCGACTTCGG CGCGGTCGAG
CACGTCTTCA CCCACTTCTC GCTCACGCTG CGAGTGCTGC GGGCCGAGAG CAACGGCGAG
GGCGACTTCG TCTGGACCGA TCCAGCGGGG CTGGCCGCGC TGCCCAGCGT ATTTCTGAAG
GCCGCGAAGG CGGGGCGGGC GCGACTGGTC TAA
 
Protein sequence
MLDIPALRAA LLAWYDAQAR DLPWRTGPAA GKAGQRSDPY RVWLSEVMLQ QTTVPHATPY 
FLSFTQRWPT VSSLAAVADD DLMAAWAGLG YYARARNLLA CARAVAAEHG GVFPDTEAAL
RALPGVGAYT AAAVAAIAFD REANVVDGNV ERVMARLFAV EDPVPDAKPE LKRLAGELVT
AARPGDWAQA LMDLGATVCR PKGPLCDRCP VSAWCEGFKT GAPETYPRKT KKAERPRRYG
VAYVLTRGEA TALVRRPPKG LLGGMLGLPT SDWRDRPWTD FEAAATAPAA GAWRDFGAVE
HVFTHFSLTL RVLRAESNGE GDFVWTDPAG LAALPSVFLK AAKAGRARLV