Gene Caul_4623 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4623 
Symbol 
ID5902085 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5001448 
End bp5002839 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content64% 
IMG OID641565142 
ProductS-adenosyl-L-homocysteine hydrolase 
Protein accessionYP_001686241 
Protein GI167648578 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0499] S-adenosylhomocysteine hydrolase 
TIGRFAM ID[TIGR00936] adenosylhomocysteinase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCGACT ATATCGTCAA GGACATCTCG CTCGCCGATT TCGGCCGCAA GGAAATCGAC 
ATCGCCGAGA CCGAAATGCC CGGTCTGATG GCCACCCGCG CCGAATACGG CCCGGCCCAG
GTCCTGAAGG GCGCCCGCAT CGCCGGCAGC CTGCACATGA CCATCCAGAC CGCCGTGTTG
ATCGAGACGC TCACCGCCCT GGGCGCTGAA GTCCGCTGGG CCTCGTGCAA CATCTTCTCG
ACCCAGGACC ACGCCGCCGC CGCCATCGCC GCCTCGGGCG TGCCGGTGTT CGCCTTCAAG
GGCGAGAACC TGGTCGAGTA CTGGGAATAC GCCCACAAGA TCTTCGAATG GTCGGACGGC
GGCTATCCGA ATCTGATCCT CGACGACGGC GGCGACGCCA CCCTGCTTTG CGTGCTGGGT
CCCAAGGCCG AGAAGGATCC CTCGGTTCTC GACAACCCGC AGAACGAGGA AGAAGAAGCG
CTCTACACCG TGATGAAGCG CTACCTGGCC GAAAAGCCGG GCTTCTACAC CGCCATCCGC
GACGCCATCG GCGGCGTGTC GGAAGAAACC ACCACGGGCG TGCACCGCCT GTACCAGATG
GCCGCCAAGG GCGACCTGCC GTTCCCGGCC ATCAACGTCA ATGACAGCGT CACCAAGTCC
AAGTTCGACA ACCTGTATGG CTGCCGCGAA TCGCTGGTCG ACGCCATCCG GCGCGGCACC
GACGTCATGC TGTCGGGCAA GGTCGCGGTG GTCTGCGGCT ACGGCGACGT GGGCAAGGGC
TCGGCCGCCA GCCTGCGCAA CGGCGGCGCC CGGGTGATCG TCACGGAGAT CGACCCGATC
TGCGCCCTGC AGGCGGCGAT GGAAGGCTAT GAAGTCCAGA CCCTGGACGA CTGCGCCGGC
CGCGCCGACA TCTTCGTCAC CGCCACCGGC AACAAGGACG TCATCACCGT CGATCACATG
CGGCAGATGC GTAACAACGC CATCGTCTGC AACATCGGCC ACTTCGACTC GGAGATCCAG
GTCGCCGGCC TGAAGAACTT CAAGTGGGAC GAGATCAAGC CGCAGGTCCA CCACATCGAG
TTCCCGGACG GCAAGAAGAT CATCCTGTTG TCGGAAGGCC GCCTGGTGAA CCTGGGCAAC
GCCACGGGCC ACCCCTCGTT CGTGATGTCG GCCTCGTTCA CCAACCAGAC CCTGGCCCAG
ATCGAACTGT GGACCAACAA GACGGCCTAC AAGAACGACG TCTACACCCT GCCCAAGCAC
CTCGACGAAA AGGTCGCCCT GCTCCATCTG GAAAAGCTGG GCGCCAAGCT GTCGAAGCTG
CGTCCCGACC AGGCCGAGTA CATCAACGTG CCGGAAAACG GCCCGTTCAA GCCGGACCAC
TACCGCTACT AG
 
Protein sequence
MADYIVKDIS LADFGRKEID IAETEMPGLM ATRAEYGPAQ VLKGARIAGS LHMTIQTAVL 
IETLTALGAE VRWASCNIFS TQDHAAAAIA ASGVPVFAFK GENLVEYWEY AHKIFEWSDG
GYPNLILDDG GDATLLCVLG PKAEKDPSVL DNPQNEEEEA LYTVMKRYLA EKPGFYTAIR
DAIGGVSEET TTGVHRLYQM AAKGDLPFPA INVNDSVTKS KFDNLYGCRE SLVDAIRRGT
DVMLSGKVAV VCGYGDVGKG SAASLRNGGA RVIVTEIDPI CALQAAMEGY EVQTLDDCAG
RADIFVTATG NKDVITVDHM RQMRNNAIVC NIGHFDSEIQ VAGLKNFKWD EIKPQVHHIE
FPDGKKIILL SEGRLVNLGN ATGHPSFVMS ASFTNQTLAQ IELWTNKTAY KNDVYTLPKH
LDEKVALLHL EKLGAKLSKL RPDQAEYINV PENGPFKPDH YRY