Gene Caul_2176 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2176 
Symbol 
ID5899631 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2361831 
End bp2363510 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content69% 
IMG OID641562667 
Productcyclohexanone monooxygenase 
Protein accessionYP_001683802 
Protein GI167646139 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2072] Predicted flavoprotein involved in K+ transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.665097 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.481934 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACCG GGGAGGAAAT CATGGCGTCA GCAATCCCCC ATCCGGGCGC GAGCTCGGCT 
GTTCGCGAAC CGCTTCGCAT CCCCGCCCCA GACACCAAGA ACGGGGCGCC GCTCGACGCC
GTCGTGGTCG GGGCCGGCTT CGGCGGGCTC TACATGGTCC ACCGGCTGCG CGAGGCGGGG
CTGTCGGTCC AGGGGATCGA GGCGGCCGGC GACGTCGGCG GCACCTGGTA CTGGAACCGC
TATCCCGGCG CCCGCTGCGA CATTCCCAGC CTGCTCTATT CCTACACCTT CTCGGACGAC
CTGCAGGCCG ACTGGCGGTG GAGCGAGAAG TACGCCACCC AGCCCGAGAT CCTGGCCTAC
GCCCAGCATG TGGCCGACCG CTTCGCCCTG CGCGACGCCT TCCTGTTCGA GACCCGCGTG
GTCTCGGCGG TCTTCGACGA GGCGGCCAGC CTCTGGCGGG TGAAGACCGA CCGAGGCGAC
GAGATCGCCG CCCGCTTCTG CGTGATGGCC ACCGGCTGCC TGTCGGTCCC GCGCGACTTC
GACCTGCCGG GCGCCGAGAC GTTCGAGGGC GAGACCTACG TCACCGGCCT GTGGCCGCAC
GAGCCGATCG ACTTCACCGG CAAGCGGGTG GCGGTGATCG GCACGGGCTC GTCGGGGATC
CAGTCGACGC CGCTGATCGC CGAGCGGGCC GAGCACGTTT ATGTCTTCCA GCGCACGCCC
AACTACAGCC TGCCGGCCCT GAACGCTCCC CTGACCGAGG AAGAGGTGGC CGCCTTCCGC
GCCAACTTCC CGGCCTATGT CGAGCTGCTG CGGGCCGGCC AGCCGCCCCT GCCCGTCCCG
CCGGCCGACT ATGTTCCCAG CGACGAGGAA CTGAACGCCC TGGTCGCCCA GCTGTGGAAC
GGCTGGGGGC TGATCTCGGC GGTGCAGATC CCCAACATCG TGCGCGACGA GCGCTGCAAC
CAGGCGGCGG GCGATTTCGT CCGCGCCAGG ATCGCGCAAA CCGTCAAGGA CCCGGCCCTG
GCCGAGAAAC TGACCCCGCG CGACTTCCCG ATCATGACCA AGCGGGCCTG CGTCGACACC
GGCTGGTACG AGGCCTTCAA CCGCGACAAC GTCACCCTGG TCGACCTGCG CGAGACGCCG
ATCGAGGCGA TCACCCCGGC CGGCGTGCGC ACCACCGAAC GCGAATACCC CGTCGACGTG
ATCGTCAGCG CCATGGGTTT CGACGCCATG ACCGGCGCCC TGCTGCGCAT GGACATCCGC
GGCCGCGACG GCCTGAGCCT GGCCGAGGCC TGGGCCGAGG GACCCAAGAC CTATCTGGGC
CTGGCGGTGG CGGGATTCCC CAACCTGTTC ACGGTCACGG GTCCAGGCAG CCCCTCGGTG
CTGACCAATG TGCTGACCGC CTGCGAGCAG CATGTCGACT GGATCATGGA CGCCATCACC
CACGTGCGCG CCACCAACGC CGCGACGCTG GAGGCGACCG TCCAGGCCCA GGAAGCCTGG
GTCGCCCACG TCAACGACGA GGCCGACAAG ACGCTCTTCC CCCGCGCCGC CTCCTGGTAC
ATGGGCGCCA ACATCCCGGG CAAACCCCGG GTGTTCATGC CCTATGTCGG CGAGGGCTAT
AAGATCCGCT GCGACACGGT TGCGGCGGAG GGGTATGCGG GGTTCGCCCT GGGGTCTTGA
 
Protein sequence
MKTGEEIMAS AIPHPGASSA VREPLRIPAP DTKNGAPLDA VVVGAGFGGL YMVHRLREAG 
LSVQGIEAAG DVGGTWYWNR YPGARCDIPS LLYSYTFSDD LQADWRWSEK YATQPEILAY
AQHVADRFAL RDAFLFETRV VSAVFDEAAS LWRVKTDRGD EIAARFCVMA TGCLSVPRDF
DLPGAETFEG ETYVTGLWPH EPIDFTGKRV AVIGTGSSGI QSTPLIAERA EHVYVFQRTP
NYSLPALNAP LTEEEVAAFR ANFPAYVELL RAGQPPLPVP PADYVPSDEE LNALVAQLWN
GWGLISAVQI PNIVRDERCN QAAGDFVRAR IAQTVKDPAL AEKLTPRDFP IMTKRACVDT
GWYEAFNRDN VTLVDLRETP IEAITPAGVR TTEREYPVDV IVSAMGFDAM TGALLRMDIR
GRDGLSLAEA WAEGPKTYLG LAVAGFPNLF TVTGPGSPSV LTNVLTACEQ HVDWIMDAIT
HVRATNAATL EATVQAQEAW VAHVNDEADK TLFPRAASWY MGANIPGKPR VFMPYVGEGY
KIRCDTVAAE GYAGFALGS