Gene Caul_0494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0494 
Symbol 
ID5897949 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp536249 
End bp537364 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content67% 
IMG OID641560977 
Productsaccharopine dehydrogenase 
Protein accessionYP_001682126 
Protein GI167644463 
COG category[S] Function unknown 
COG ID[COG3268] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.961307 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCTTC ATGGGCGATG GCATAGGGGC GGCTCGAGCA GGGGAGAGAC CATGACCGCC 
GTCTGGATAC TGGGCGCCAC GGGCCGCACC GGCAGCGTGA TCGCGACGAA CCTGGCCGCC
GCCGGGGTCG GGTTGGTCCT TGTCGGGCGA GATGGCCCCG CTCTGCAACA TTTGGCGGAC
AAGATCGGCG GAAATCCAAG GGTCCTTGCG ACTGCCAGCC TTGAGAAGAT CAAGACCGAA
CTCGACGGAG CCGGCCCGAC CGTGGTCGTC AACCTCATCG GGCCATTCGC TGAAACGGCG
CTACCTTTCA TAAAGGCATG CGCCCCCGGC AGCGGGTATC TTGATCTCTC CAATGACCGC
GCCGCGACAG CAGCAATTCT CGATCTGGAT CAAAAGGCCC GGACAACCGG CCGATGCCTG
GTCAGCGGTG CGGGCTGGGG CGTGCTCGCA GCGGAGAGCA CGGTGCTCAT GCTCTGCAAG
GATCGACCGC CCGCAGCGCG GGTGAGAGTC GATCTGGCGC CTTTCATCAA CGCGTCCGGC
CGGATCGGCG AAACGTTCGC CGCCACCCTG GTCGAGGCGA TGGCCGTCGG CGCGCAAATC
TATGAAGACG GTCGGCTGAC TCGAGCCCGC ATCGGAGACC GGAGCGAAAC CCTGATCGCC
CCCGACGGAT CGAAAATCCG GACGGGCGTG GTTTCCAGTG GCGATCTGGA GGCGGCCCGA
CGTGCGAGCG GCGCGGCCTT CGCGGTGGCC GCCTCGACCC TGGCGCCCAG CTCGGGGGGC
GCACGCGCGG CGATGTCCGC GATTGTGTTC CTGCTCGGCT TCCGCAGCGC CCGAGAAGTC
GCGAAACGGC TCTTGGCCAA TGTCGTCGCG CCGCCCGCCA AGGGGGCGCC CAAATCATCC
TGGGCTCATG CCAGGGTCGA GTGGGCGGAC GGAACGATGC GCGAGACCTG GCTCCGAGCG
GGCGAGGGCA TGGCTTTCAC GTGCAAGGTC GCCACCGAGG TCGCTCTTCG GCTTTCACGC
GGCGAGGGGC GGCCGGGGGC CTTCACGCCA GCCGCGCTAT TTGGGCCGGA ACTGGCCGAG
GCGGCCGGAG CGAAATTTAT CGGTGAGCGA AGGTGA
 
Protein sequence
MALHGRWHRG GSSRGETMTA VWILGATGRT GSVIATNLAA AGVGLVLVGR DGPALQHLAD 
KIGGNPRVLA TASLEKIKTE LDGAGPTVVV NLIGPFAETA LPFIKACAPG SGYLDLSNDR
AATAAILDLD QKARTTGRCL VSGAGWGVLA AESTVLMLCK DRPPAARVRV DLAPFINASG
RIGETFAATL VEAMAVGAQI YEDGRLTRAR IGDRSETLIA PDGSKIRTGV VSSGDLEAAR
RASGAAFAVA ASTLAPSSGG ARAAMSAIVF LLGFRSAREV AKRLLANVVA PPAKGAPKSS
WAHARVEWAD GTMRETWLRA GEGMAFTCKV ATEVALRLSR GEGRPGAFTP AALFGPELAE
AAGAKFIGER R