Gene Caul_5439 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5439 
Symbol 
ID5897233 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010333 
Strand
Start bp152721 
End bp153932 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content59% 
IMG OID641550726 
Productalkane 1-monooxygenase 
Protein accessionYP_001672212 
Protein GI167621704 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.186555 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGCA TCGCACGAAA TCCGTCCGCC AATACCGCGC CAGATCGCTA TGTGGACAGG 
AAACGCTATC TATGGATGTT GTCGGTCGTC TGGCCGGCAG CGCCGTTGAT CGGCCTCTAC
CTCGTGAGCA TGACGGGTCT GGGCGTGTTT TACGCCTTTA CGCTAGTCGT TTGGTATGTT
GCGATTCCCG CTTTGGACAT TTTGTTTGGG AATGATCCGA ACAATCCTCC TGAGGCAGCT
GTCGCGCGCC TTGAGGCGGA TCGATATTAT CGAGTTCTCA CCTATCTTAC CGTCCCCGTG
CATTATGCCT CGCTCATCGT CTCGGCGTGG TGGGTAGCGA CCCAGCCGAT GGCCTGGTGG
GAAGTTGTCG CGCTTGCCTT GTCCCTGGGC ATCGTCAACG GCTTGGCGCT GAACACCGGC
CATGAGTTGG GACACAAGAA GGAAGCCTTC GACCGCTGGA TGGCCAAGAT CGTTCTGGCC
GTGGTCGGCT ATGGCCACTT CTTCATCGAG CACAACAAAG GCCACCATCG CGACGTTGCG
ACCCCCGAAG ATCCCGCTAC GTCCAAGATG GGGGAGAGTA TCTATAAATT TTCGCTGCGC
GAGATTCCGG GTGCCTTCAA GCGGGCTTGG AGTCTGGAGA GGGTGCGGCT GGAGCGCCTA
GGCAAGGGGG TCTGGCGCCT GGACAACGAA ATCATCCCGC CGCTGCTGAT CACCGTAGTT
CTCTACACAT CCCTTTTGCT GGCGTTCGGC CCCAACCCCA AGTTGTTAGT GTTCTTGCCC
ATCCAGATCG CCTTCGGATG GTGGCAGCTG ACCAGCGCCA ATTATATCGA GCACTATGGG
TTGCTTCGCG AGAAAATGGC GGACGGGCGT TATGAGCGCG CCCAGCCCCG GCATTCCTGG
AACAGCAATC ACATCGCCTC GAATCTGATC CTGTTCCATC TTCAAAGGCA TTCCGATCAC
CATGCCCACC CGACCCGCAG CTATCAGTCG CTCCGTGACT TTAAAGACCT GCCGGAGTTG
CCGAGCGGTT ACCCCGGCAT GTTCTTCATG GCGATGATTC CGCCCGTGTT CCGGTCGGTG
ATGGACCGCC GGGTCGTGGA ATGGGCGGGC GGCGATCTTG GCAAGATTCA GATCGACGGT
GCGCGCAGGA AGCAGATCGA ACGGAAGTTC GGTGCGGCTT CGCGCCAGCA GGCGCGGGCG
GCGGCCGAGT AG
 
Protein sequence
MSSIARNPSA NTAPDRYVDR KRYLWMLSVV WPAAPLIGLY LVSMTGLGVF YAFTLVVWYV 
AIPALDILFG NDPNNPPEAA VARLEADRYY RVLTYLTVPV HYASLIVSAW WVATQPMAWW
EVVALALSLG IVNGLALNTG HELGHKKEAF DRWMAKIVLA VVGYGHFFIE HNKGHHRDVA
TPEDPATSKM GESIYKFSLR EIPGAFKRAW SLERVRLERL GKGVWRLDNE IIPPLLITVV
LYTSLLLAFG PNPKLLVFLP IQIAFGWWQL TSANYIEHYG LLREKMADGR YERAQPRHSW
NSNHIASNLI LFHLQRHSDH HAHPTRSYQS LRDFKDLPEL PSGYPGMFFM AMIPPVFRSV
MDRRVVEWAG GDLGKIQIDG ARRKQIERKF GAASRQQARA AAE