Gene Caul_4440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4440 
Symbol 
ID5901901 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4806150 
End bp4807355 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content63% 
IMG OID641564958 
Productsaccharopine dehydrogenase 
Protein accessionYP_001686058 
Protein GI167648395 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1748] Saccharopine dehydrogenase and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCAAGG TGCTGGTGAT CGGCGCTGGC GGCGTCGGTT CGGTCGCGGT CCATAAGATG 
GCGATGAACA CGGACGTGTT TTCGCACATC ACTTTGGCCA GCCGCACGAA GTCGAAGTGC
GACGCGATCG CGCAATCCGT GAAGCAGCGA ACCGGCGTGA CCATCGACAC GGCCGCGCTC
GACGCCGACG ACGTCGCCGC GACCACGGCG CTGATCCAGG CGGTCAAGCC AGAGCTGGTG
GTCAATCTGG CGCTGCCCTA TCAGGATCTG AACATCATGG ACGCCTGTCT GGCGACCGGG
GTGAACTATC TCGATACGGC CAACTACGAG CCGCGCGACG AGGCCAAGTT CGAATATAGC
TGGCAGTGGG CCTATCAGGA CCGCTTCAAG GAGGCCGGCC TGATGGCCCT GCTGGGCAGC
GGCTTCGACC CCGGCGTGAC CTCGGTGTTC ACCACCTACA CCAAGAAGCA CCTGCTGGAC
CGGATCGACA CGCTCGACAT CCTGGACTGC AACGGCGGCG ATACCGGCCT GCCCTTCGCC
ACCAACTTCA ATCCCGAGAT CAACCTGCGC GAAGTGACCG CGCCCTCGCG GCACTGGGAA
AACGGCCAGT GGATCGAGGG GCCGGCGCTG AGCCACAAGC AGGTGTTCGA CTTCGACCAG
GTTGGGCCGA AGAACATGTA CCTCATGTAT CATGAGGAGC TGGAATCCCT GGCCAAGTTC
TATCCGGAGA TCCAGCGCAT CCGCTTCTGG ATGACGTTCG GCGACTCCTA TCTCAAGCAC
CTGGAGGTGC TGGAGAACAT CGGCATGACC CGCATCGAGC CGATGATGTT CCAGGGGCGC
GAGATCATCC CCATCGAGTT CCTCAAGGCC CTGCTGCCCG AGCCGTCGTC GCTGGGTCCG
ATCACCAAGG GCAAGACCAA TATCGGCACG ATCGCTACGG GCCAGAAGGA CGGCCAGGCC
CGGACGGTCT ACGTCAACAA CGTGTGCGAC CACGAGGCCG CCTATGCCGA GACCGGCAAC
CAGGCCGTCA GCTACACGAC AGGCGTCCCG GCCATGATCG GCGCGGCCCT GATGATGACC
GGCCAATGGA AGGGCGCGGG CGTGTTCAAC ATGGAGCAGC TGGACCCCGA TCCGTTCATG
GACATGCTGA ACAAGCACGG CCTGCCCTGG CAGGTTCGCG ACCTCGACGC CCCGCTGGAC
TTCTGA
 
Protein sequence
MGKVLVIGAG GVGSVAVHKM AMNTDVFSHI TLASRTKSKC DAIAQSVKQR TGVTIDTAAL 
DADDVAATTA LIQAVKPELV VNLALPYQDL NIMDACLATG VNYLDTANYE PRDEAKFEYS
WQWAYQDRFK EAGLMALLGS GFDPGVTSVF TTYTKKHLLD RIDTLDILDC NGGDTGLPFA
TNFNPEINLR EVTAPSRHWE NGQWIEGPAL SHKQVFDFDQ VGPKNMYLMY HEELESLAKF
YPEIQRIRFW MTFGDSYLKH LEVLENIGMT RIEPMMFQGR EIIPIEFLKA LLPEPSSLGP
ITKGKTNIGT IATGQKDGQA RTVYVNNVCD HEAAYAETGN QAVSYTTGVP AMIGAALMMT
GQWKGAGVFN MEQLDPDPFM DMLNKHGLPW QVRDLDAPLD F