Gene Caul_4907 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4907 
Symbol 
ID5902369 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5299992 
End bp5301431 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content67% 
IMG OID641565427 
Productpyridoxal-5'-phosphate-dependent protein beta subunit 
Protein accessionYP_001686525 
Protein GI167648862 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0031] Cysteine synthase 
TIGRFAM ID[TIGR01137] cystathionine beta-synthase 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGACC CCGTTTTCGC CCTTCCGCCG GTCGCGAATT CCGCACTGGA CCTGATCGGC 
CACACGCCGA TGATGGAGGT CCGCAACCTC GACACCGGTC CATGCCGGCT GTTCCTCAAG
CTCGAGAACC AGAACCCCGG CGGCTCGATC AAGGACCGCG TGGCCCGGTC GATGATCGAG
GCGGCCGAGG CCGACGGCAG CCTCAAGCCC GGCGGGACGA TCATCGAGGC CACGGCCGGC
AATACGGGCC TGGGCCTGGC CCAGGTAGCG ACGCTGAAGG GCTACAAGCT GATCCTGATC
GTGCCGGACA AGATGGCTCG GGAGAAGATC TTGCACCTGC GGGCCATGGG CGTGGACGTC
CGCCTGACCC GCAGCGACGT CGGCAAGGGC CACCCGGAAT ACTACCAGGA CATGGCCCAG
ACCCTGGCCC AGTCGATCCC GGGCGCGATC TATGTCAATC AGTTCGAGAA CCCCGCCAAC
CCGCTGGCCC ACGAGACGAC CACCGCGCCC GAGATCTTCG AGCAGATGGG CGGCGACATC
GACGCGATGG TGGTCGGCGT CGGCTCGGGC GGCACCCTGA CGGGTGTCGG CCGGTTCATG
GCCAAGCATT CGCCCAAGAC GGAAATGGTG CTGGCCGACC CGGTCGGCTC GATCCTGTGC
GACTACGTGG CCACGGGGAC CTATGGCGAA GCCGGCTCGT GGATCGTCGA GGGCATCGGC
GAGGACTTCA TCCCGGTCAA CGCCGAGATG GACTTCGTCA AGCACGCCTA TTCGATCAGC
GACCGCGAGA GCGTCGACAC CGCCCGGCTG CTGCTGCGCA AGGAGGGCAT CCTGGCCGGC
TCGTCGTCGG GCACCCTGCT GGCGGCCGCC CTGCGCTACT GTCGCGAGCA GACCGAGCCG
AAGCGGGTGG TGACCCTGGT CTGCGACACG GGCTCCAAGT ACCTGACCAA GATGTTCAAC
GACATGTGGC TGGCCGCCCA TGGCTTCGAC CAGCGCGAGC TGCACGGCGA CCTGCGCGAC
CTGATCGCCA AGCGGTACGC CGACGGCGGG GTGGTGGCGA TCGGGCCGGA CGACACCCTG
CTGACCGCCT ACAACCGCAT GCGCGGCGGC GACATCAGCC AGCTGCCGGT GGTCGATCAC
GGCAAGCTGA TCGGCATTCT CGACGAGAGC GACATCCTGG CCGCCGTCGA GGGCGTCGAA
GACGACGATC GCGGCCCGAA GTTCAAGACC CTGGTCGGGG CGGCGATGAC CAGGGCGGTC
AACACCCTGC AATCGACGCA GGGCGTGGAC GCCCTGCCCG AGGTCTTCGA CCGCGACGAG
GTCGCCCTGG TCTGCGACGG CGACGAATTC GTCGGAGTGA TCACCCGGGT GGACCTGATC
AACCACCTGC GGATGAGCGC GCCTAGTTGT TGCGGAAATG TTCCGCCGGA GTCATTCTAG
 
Protein sequence
MNDPVFALPP VANSALDLIG HTPMMEVRNL DTGPCRLFLK LENQNPGGSI KDRVARSMIE 
AAEADGSLKP GGTIIEATAG NTGLGLAQVA TLKGYKLILI VPDKMAREKI LHLRAMGVDV
RLTRSDVGKG HPEYYQDMAQ TLAQSIPGAI YVNQFENPAN PLAHETTTAP EIFEQMGGDI
DAMVVGVGSG GTLTGVGRFM AKHSPKTEMV LADPVGSILC DYVATGTYGE AGSWIVEGIG
EDFIPVNAEM DFVKHAYSIS DRESVDTARL LLRKEGILAG SSSGTLLAAA LRYCREQTEP
KRVVTLVCDT GSKYLTKMFN DMWLAAHGFD QRELHGDLRD LIAKRYADGG VVAIGPDDTL
LTAYNRMRGG DISQLPVVDH GKLIGILDES DILAAVEGVE DDDRGPKFKT LVGAAMTRAV
NTLQSTQGVD ALPEVFDRDE VALVCDGDEF VGVITRVDLI NHLRMSAPSC CGNVPPESF