Gene Cpha266_0626 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0626 
Symbol 
ID4569779 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp703023 
End bp704282 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content49% 
IMG OID639765223 
Product2-nitropropane dioxygenase, NPD 
Protein accessionYP_911105 
Protein GI119356461 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCGTAG ACAACTTCAG GTTACAATTA GGAAAACAAG AGTACGTTCC TCTCGTCATC 
GGAGGCATGG GAGTCAACAT ATCAACAACC GAACTTGCGC TCGCTGCTGA AAGACTCGGC
GGCATAGGCC ATATTTCGGA TGCCGAGACG GGATATGTCT GTGATCAATT ATTTGGAACA
TCCTTTGTCA GCACAAAAAG AAAACGGTAC ATCGACAACA TCAACAACCC CGACAAGGCG
AAAGTCCTTT TTGACCTTGG AGAAGTAGCC GAAGCCCAAA AAAAATACAT CGAGCATACC
GTTTCGCAAA AAACCGGAAA GGGCGCGATT TTTTTGAACT GCATGGAAAA ACTGACGATG
AACAATGCGC AGGAAACCCT GAAAGTTCGC CTCGCTGCCG CAATGGATGC CGGAATTGAC
GGTCTGACCC TCGCTGCCGG CCTGAATCTG AGAACTCTTG ATCTGATTCA GGACCATCCC
CGCTTCCGCG ATGTTAAAAT CGGGATTATC ATCTCGTCGG TCAGGGCCCT GTCGATCTTT
CTGAAACGGG CAGTTCGTCT TCAGCGGTTG CCCGAATATA TTATCGTTGA AGGACCTCTG
GCTGGCGGGC ATCTGGGATT CAGCCCTGAT GACTGGCATA CTTTCGATTT AAAAACAATT
TTTAATGAAG TGATCCAGTT TCTCAAGCAA GAGAATCTGG CAATTCCCGT TATTCCTGCC
GGTGGCATTT TCACCGGAAC TGATGCCGCC GAGTATCTTG CCGCAGGAGC TTCCGCTGTT
CAGGTTGCAA CCCGTTTTAC CATTTCCAAA GAGGCCGGAC TGCCGGCAAA AGTCAAGCAG
CACTACATCA ATGCCACCGA GGAGGACATT GTCGTCAATA TGGCATCAAC GACCGGCTAC
CCGATGCGCA TGCTCATACA GTCTCCAACT CTCGACTATA CCATGAGACC TAACTGTGAG
GGGCTTGGCT ATCTGCTGGA AAACGGAGGA AAATGCAGTT ATATCGACGC CTATCAGAAA
GCTCTTGAGT CAAGAAAATC CGGAGAAAAA CTGGCAATCG GTGAAAAAAC ATGCCTCTGT
ACCGGAATGG CGAATTACGA CTGCTGGACA TGCGGTCATA TGGCTTACCG CCTCAAGGAG
ACCACGAACC GCCTTCATGA CGGATCATGG CAGCTCCCTG CGGCAGAAGA CATCTTTCTC
GATTACCAGT TCAGCAGAGA TCACCAGATT CGTCTTCCAG AGCCCGAAGA AAACGCATAG
 
Protein sequence
MIVDNFRLQL GKQEYVPLVI GGMGVNISTT ELALAAERLG GIGHISDAET GYVCDQLFGT 
SFVSTKRKRY IDNINNPDKA KVLFDLGEVA EAQKKYIEHT VSQKTGKGAI FLNCMEKLTM
NNAQETLKVR LAAAMDAGID GLTLAAGLNL RTLDLIQDHP RFRDVKIGII ISSVRALSIF
LKRAVRLQRL PEYIIVEGPL AGGHLGFSPD DWHTFDLKTI FNEVIQFLKQ ENLAIPVIPA
GGIFTGTDAA EYLAAGASAV QVATRFTISK EAGLPAKVKQ HYINATEEDI VVNMASTTGY
PMRMLIQSPT LDYTMRPNCE GLGYLLENGG KCSYIDAYQK ALESRKSGEK LAIGEKTCLC
TGMANYDCWT CGHMAYRLKE TTNRLHDGSW QLPAAEDIFL DYQFSRDHQI RLPEPEENA