Gene Csal_1015 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1015 
Symbol 
ID4027861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp1145513 
End bp1146616 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content72% 
IMG OID637966192 
Product2-nitropropane dioxygenase, NPD 
Protein accessionYP_573071 
Protein GI92113143 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGGCCGC AACGTGAACT GCTCGACCGC CTCGCCATCG AACTGCCCAT CGTCCAGGCC 
CCCATGGCCG GCGCCAACGA CGCGACACTG GCCATCGCCG CCAGTCGAGG GGGCGCGCTG
GGCTCGATTC CTTGCGCCAT GCTCGCCCCC GAACGCATCG AGCGGGAGGT CACACGGTTT
CGCGAGCATG CCACCGGCCC GCTCAACCTC AACTTCTTCT GCCACTTGCC GTCACCGCCC
GACCCCAACG CCGAAGCCGC CTGGCGCGAA CGCCTGGCAC CGTTCTACCG CGAGGCGGGG
CTCGACCCCG AGGATGCCGC GCCGGCCGCC CAACGCACGC CCTTCGACGA CGTCCAGTGC
GTGCTGGTCG AGCGTCTGCG TCCCGAGGTC GTGAGCTTTC ATTTCGGCTT GCCGGACGCG
CCCTTGCTGG CTCGCGTGAA AGCCACCGGC GCCACGGTCA TGGCCAGTGC CACCACCGTC
GCCGAGGGGC GCTGGCTGGC CACCAACGGC GCGGACATCA TCATCTCCCA GGGGCTCGAA
GCCGGCGGGC ACCGCGGCGC GTTTCTCGAG GATACCCGCG CGGACACGGT GGCCGACGCC
ATGGCCCGCC AGCCCGGCAC CTTCGCGCTG GTACCGCAGC TCGTCGATGC CATCGACCGG
CCCGTCATCG CCGCCGGGGG CATCGGCGAC GCACGCGGCG TCGCCGCCGC CTTCGCGCTG
GGTGCCTGCG GCGTGCAGCT CGGCACCTAC TACCTGGCCA CGCCGGAAAG TCTGATCAGC
GACATTCATC GCGCCGCCCT GGCCGAGGCC CGCGACGACA ACAGCGTCGT CACCCGCCTG
TTCTCCGGTC GCCCGGCGCG CAGCCTCGTC AATCGAGTGA TTCGCGCACT TGGCCCTCTC
TCGCCAGCCG CTCCGCCCTT TCCCACCGCC GGTGGCGCGC TTGCCCCGCT CAAGCAAGCC
GCCGAGGCCC AAGGGCGTGG CGACTTCTCA TCGCTGTGGG CCGGCCAGGC AGCGGCACTG
GCCCCCCACG GCGACGCCGA GACCCTCACG CGCCGACTGG GCGACGAGAC GCTGGCACGA
CTCCAGGCGC TGGCTTCGCG TTGA
 
Protein sequence
MWPQRELLDR LAIELPIVQA PMAGANDATL AIAASRGGAL GSIPCAMLAP ERIEREVTRF 
REHATGPLNL NFFCHLPSPP DPNAEAAWRE RLAPFYREAG LDPEDAAPAA QRTPFDDVQC
VLVERLRPEV VSFHFGLPDA PLLARVKATG ATVMASATTV AEGRWLATNG ADIIISQGLE
AGGHRGAFLE DTRADTVADA MARQPGTFAL VPQLVDAIDR PVIAAGGIGD ARGVAAAFAL
GACGVQLGTY YLATPESLIS DIHRAALAEA RDDNSVVTRL FSGRPARSLV NRVIRALGPL
SPAAPPFPTA GGALAPLKQA AEAQGRGDFS SLWAGQAAAL APHGDAETLT RRLGDETLAR
LQALASR