Gene Caul_5259 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5259 
Symbol 
ID5897255 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010335 
Strand
Start bp193752 
End bp194741 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content69% 
IMG OID641555362 
Product2-nitropropane dioxygenase NPD 
Protein accessionYP_001676693 
Protein GI167621908 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000246639 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGATAC CGGAAGTTCT GCGGCGCGGA TTGCGCCTGC CCGTCATCGC CTCGCCCCTG 
TTCATCATCT CCGGGCCGGA GCTGGTCATC GCCCAGTGCA AGGCTGGGGT GATCGGGTCG
TTCCCGGCGC TGAACGCGCG GCCCGGTCCC GTGCTCGAGG ACTGGCTGAT CCAGATCACG
CAGGCGTTGG CCGACCACGA CGCACGCCAT CCCGACCGTC CCGCCGCGCC GTTCGCGGTC
AACCAAATCG TCCATAAATC CAACAACCGG CTCGAACACG ACCTGGGTCT GTGCGTCAAA
TACAAGGTCC CCCTGGTCAT TACCTCGCTG GGCGCGCGCG CTGATCTGAA CGCGGCGGTG
CACGGCTATG GCGGGCTGGT CATGCATGAT GTCATCGACC AGACCTTCGC CCGCAAGGCC
GTCGACAAGG GGGCCGACGG CCTGATCCTG GTGGCGACAG GGGCAGGCGG CCATGCGGGC
GATCAATCGC CGTTCGCCCT GGTCGAGGAG ACCCGCGCCT GGTTCGACGG CCCCGTGGCG
CTCTCGGGCG CGATCGCGAC CGGTCGGGCG GTGCTGGCGG CAGAGGTTCT GGGCGCGGAC
TTCGCCTATG TCGGCTCGGC CTTCATCGCC ACCGAGGAGG CCCGGGCCGC CGCGGCCTAC
AAGCAGGCGA TCGTCGAGGG CCAGGCCGCG GACATCGTGC TCAGCAACCT TTTCACGGGC
GTGCACGGCA ACTATTTGCG GCCGTCCATC GTGGCCGCCG GTCTTGATCC GGACGCGCTT
CCCACCAGCG ATCCCAGCGC CATGGATTTC GGGTCGGGCG GCAATACAGA CGCCAAGGCC
TGGCGTGACA TCTGGGGCTC GGGCCAGGGG ATCGGCGCGG TGCGCGCCGT CACCCCGACC
GCGACGCTGG TCGCGCGATT GACCGACGAA TACGCGGCGG CCAAGGCGGG GCTGACGCTG
GCCGCGCCGG CTCTGGAACC CGCCCTTTGA
 
Protein sequence
MAIPEVLRRG LRLPVIASPL FIISGPELVI AQCKAGVIGS FPALNARPGP VLEDWLIQIT 
QALADHDARH PDRPAAPFAV NQIVHKSNNR LEHDLGLCVK YKVPLVITSL GARADLNAAV
HGYGGLVMHD VIDQTFARKA VDKGADGLIL VATGAGGHAG DQSPFALVEE TRAWFDGPVA
LSGAIATGRA VLAAEVLGAD FAYVGSAFIA TEEARAAAAY KQAIVEGQAA DIVLSNLFTG
VHGNYLRPSI VAAGLDPDAL PTSDPSAMDF GSGGNTDAKA WRDIWGSGQG IGAVRAVTPT
ATLVARLTDE YAAAKAGLTL AAPALEPAL