Gene Caul_5309 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5309 
Symbol 
ID5897088 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010333 
Strand
Start bp19001 
End bp20143 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content67% 
IMG OID641550602 
Product2-nitropropane dioxygenase NPD 
Protein accessionYP_001672088 
Protein GI167621580 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.1832 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTTGC ACAGCCGCAT CTGCGAGATT TTCGGTATCC GCTATCCCAT CGTCCTCGCC 
GGAATGGGCG GGGCCAGTGT TCCCCGCCTG GCAGCGGCGG TCTCCAATGC CGGAGGGCTA
GGAATCCTTG GCGCCGCAGC CTGCTCGCCC GAGGAACTTC GCGCGTGGAT CCGCGAAGTG
CGATCGCTCA CCGACAAGCC CTTCGGCGTC GACACATTGC TGCCGGCCTC GGTCCGCCGC
GAGGTGGTCG ACGCGGCGGC AGGCTCCGGC GAGGGCGGCA AGCCGTCGCC CATGGATCTG
CTTGGCGACT ATCAGGCGTT TGCAGCCGAC TTCATGCGCC AGGAAGGCCT GCAAAAGGTC
GTCCGTCCGC GCGAGGACAC CAACGCCGAA GCTCGGGGCG GCCCCGCGTT CTTCTCCAAG
GAGTTCTTCG AGGCGCAAAT GGAGGTGGTG ATCGAGGAAA AGGTGCCGGT CTACGCCGCG
GGCCTGGGAA ACCCCGGCCC CTGGATGGAG CGACTGCGGG AGAACGGTAC AAAGATCATG
GCCGTCATCG GGTCTGTGAA GCACGCGCTG CAGGTCGCCG CGTCTGGCAT CGACGTGGTG
GTCGCTCAGG GACATGACGG GGGCGGACAC AATTCGCCGA TCGGCACCAT GGCGCTTATC
CCCCAGGTCG TCGACGCCAT GGCGGGGCGC ATCCCGGTGC TCGGGGCCGG CGGTATTGCC
GACGGCCGCG GCGTAGCCGC CGCGATGATG CTGGGGGCTG AAGGCGCCTG GGTGGGCACC
GCTTTCCTGG CGACGGAGGA AGCCGGCATT CAGCAATTCC AGAAGGAGGT TCTGGTCGAG
TACGGCGATG GCGACACGGT AGTGTCAAAA TCCGTCACCG GAAAGCCGGC CAGGATCATT
CGCAATAAGT GGGCGCAGGC GTGGGTGGAC GCGGAGAAGT CACCGCTGCC CATGCCCTTC
CAGTCGATCA TCGCCGGGCC CGTGCTCGCG GCGGCGACCC TGGACCAGCG CAAGGATATC
GCGCCTGGGT TTGCCGGCCA GGGCATGGGG CTCATCAAGG CGATTCGCCC CGCCCGGGAC
GTCCTGGAAG ACCTCGTCAG CGGCGCCGAG ACCGCGCTCG CTCGCGCCGA CCGTTTTCGC
TAA
 
Protein sequence
MALHSRICEI FGIRYPIVLA GMGGASVPRL AAAVSNAGGL GILGAAACSP EELRAWIREV 
RSLTDKPFGV DTLLPASVRR EVVDAAAGSG EGGKPSPMDL LGDYQAFAAD FMRQEGLQKV
VRPREDTNAE ARGGPAFFSK EFFEAQMEVV IEEKVPVYAA GLGNPGPWME RLRENGTKIM
AVIGSVKHAL QVAASGIDVV VAQGHDGGGH NSPIGTMALI PQVVDAMAGR IPVLGAGGIA
DGRGVAAAMM LGAEGAWVGT AFLATEEAGI QQFQKEVLVE YGDGDTVVSK SVTGKPARII
RNKWAQAWVD AEKSPLPMPF QSIIAGPVLA AATLDQRKDI APGFAGQGMG LIKAIRPARD
VLEDLVSGAE TALARADRFR