Gene Caul_0425 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0425 
Symbol 
ID5897699 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp465196 
End bp466197 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content67% 
IMG OID641560911 
Product2-nitropropane dioxygenase NPD 
Protein accessionYP_001682060 
Protein GI167644397 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCCCG CCGCCTTCGC GCCTCGCCTT CGCCTGCCGG TCATCGGCTC GCCGCTATTT 
ATCATCTCGG GCCCCGACCT GGTGATCGCC CAGTGTAAGG CTGGGATCAT CGGCTCGTTT
CCGTCGCTGA ACGCTCGCCC CCTCTCCCTG CTCGACGAGT GGCTGCACCG CATCACCGAG
GAGCTGGCCG CCTGGGACCG GGCGCATCCG GAAAGCCCCT CGGCGCCGTT CGCGGTCAAC
CAGATCGTTC ACAAGACCAA CAACCGCTTG GACGAGGACC TGGCGCTCTG CGTCAAGTGG
AAGGCTCCTC TGGTCATCAC CTCGCTGGGC GCGCGCGCGG ACGTCAATCA AGCCGTCCAC
GACTATGGCG GTCTGACCTT CCACGATGTC ATCAACGATC GCTTCGCCCA CAAGGCCATC
GAGAAAGGCG CCGACGGCCT CATCGCCGTC GCGGCGGGAG CCGGCGGCCA CGCCGGCACC
CTGTCGCCCT TCGCCCTGAT CCAGGAGATC CGAGCCTGGT TCGAGGGCCC CTTGGCGCTG
TCGGGCTCGA TCGCCAACGG CGCCGCGATC CTCGCCGCCC AGGCCCTGGG CGCGGATTTC
GCCTATATGG GCTCGGCCTT CATCGCCACC CAGGAAGCCA ACGCCGATCC CGCCTACAAG
CAGATGATCG TCGAGGCCGC CTCGTCCGAC ATCCTCTATT CCAACCTCTT TACCGGCGTG
CACGGCAACT ATCTTCGCCC GTCGATCATC AAGGCGGGGT TGGACCCCGA CAACCTGCCG
ATCAGCGATC CGTCGGCGAT GAACTTCGGC TCCGGCGGCA ATCAAAAGGC CAAGGCCTGG
CGCGACATCT GGGGCTGCGG CCAGGGGATC GGCGCGATCG ACGCGGTGCG CACGACCGCA
CAGTTCGTCG ATCAGTTGGA AGCCGAATAC GAGGCGGCCA TTCGGGCTTT GGACCAAAGA
ACCGAGGCGG CGGGCTCGGT GCGCGTCTGG GGCGCGGCCT AG
 
Protein sequence
MIPAAFAPRL RLPVIGSPLF IISGPDLVIA QCKAGIIGSF PSLNARPLSL LDEWLHRITE 
ELAAWDRAHP ESPSAPFAVN QIVHKTNNRL DEDLALCVKW KAPLVITSLG ARADVNQAVH
DYGGLTFHDV INDRFAHKAI EKGADGLIAV AAGAGGHAGT LSPFALIQEI RAWFEGPLAL
SGSIANGAAI LAAQALGADF AYMGSAFIAT QEANADPAYK QMIVEAASSD ILYSNLFTGV
HGNYLRPSII KAGLDPDNLP ISDPSAMNFG SGGNQKAKAW RDIWGCGQGI GAIDAVRTTA
QFVDQLEAEY EAAIRALDQR TEAAGSVRVW GAA