Gene Clim_1939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1939 
Symbol 
ID6354994 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2150148 
End bp2151407 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content51% 
IMG OID642669537 
Product2-nitropropane dioxygenase NPD 
Protein accessionYP_001943950 
Protein GI189347421 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCGTAG ACAATTTCAG GTTACAATTA GGCGGAAAAG AGTTTATACC GATCGTTATC 
GGCGGTATGG GAGTAAACAT ATCGACAACT GAACTTGCCC TTGCGGCAGA GAAACTCGGA
GGTGTCGGCC ATATCTCGGA TGCCGAGGTT TGTTACGTCT GCGACCAGAT TTTCAGTACG
TCATACGTTT CGCGGAAAAG AAAACGGTAC GCCGCCTATA CCAATAATCC CGACAAGTCT
GCGGTTCTGT TCGATCTTGA AGAGGTTGCC GAAGCCCAGA AAAGATACAT CGAGCACACC
GTTTCACAGA AAACCGGAAA GGGTGCTGTT TTTCTGAACT GCATGGAAAA ACTGACCATG
AACAATGCGT CAGAAACCCT CAAGGTAAGA CTTTCCGCCG CTCTGGATGC AGGCATCGAC
GGCCTGACCC TTGCTGCAGG CCTCAATCTG CGAACGCTCG ATCTCATTCA GGATCATCCC
AGGTTCCGCG ACGCACAGAT AGGTATTATC ATTTCTTCAG TCCGGGCTCT GGCCATCTTC
CTGAAACGGG CAGTCCGTCT CAACCGGCTT CCGGATTATA TTATCGTCGA AGGGCCTCTG
GCCGGAGGAC ATCTGGGATT CGGTCCGCTC GACTGGCATA CCTTCGACCT GAAAACCATC
GTAACGGAAG TGCTCGACTT CCTGAAAAAA GAAAACCTTG CAATTCCGGT AATTCCGGCA
GGCGGAATCT TTACCGGTAC GGATGCGGCA GATTACCTCA CCATGGGAGC TTCTGCTGTA
CAGGTTGCCA CCCGTTTTGC CATTTCAAGG GAGGCTGGCC TGCCTTCACC GGTAAAACAG
GAATATATCA ATGCCGAAGA GAAAGATATC GTGGTGAACA TGGCATCGAC AACCGGCTAC
CCGATGCGCA TGCTCGTAAA CTCGCCTACA CTGTCCTACA ACATCAAACC GAACTGCGAA
GGGCTTGGCT ATCTTCTGGA AAATGGCGGG AAATGCACCT ATATCGATGC GTATTACAAG
GCGCTCGAAA CGAAACAACC CGGCCAGAAG CTCACTCCTG TCGAAAAAAC ATGCCTCTGC
ACCGGCATGG CCCGTTACGA CTGCTGGACA TGCGGCCACA TGACCTACCG CCTCAAGGAT
ACCACGATCA GGCTTTCGGA TGGTTCATGG CTGCTCCCTT CCGCTGAACA TATTTTTCTT
GACTACCAGT TCAGCAAAGA TCATCAGATC AGATTACCTG AACCGGAGAA AAGCGTATAA
 
Protein sequence
MIVDNFRLQL GGKEFIPIVI GGMGVNISTT ELALAAEKLG GVGHISDAEV CYVCDQIFST 
SYVSRKRKRY AAYTNNPDKS AVLFDLEEVA EAQKRYIEHT VSQKTGKGAV FLNCMEKLTM
NNASETLKVR LSAALDAGID GLTLAAGLNL RTLDLIQDHP RFRDAQIGII ISSVRALAIF
LKRAVRLNRL PDYIIVEGPL AGGHLGFGPL DWHTFDLKTI VTEVLDFLKK ENLAIPVIPA
GGIFTGTDAA DYLTMGASAV QVATRFAISR EAGLPSPVKQ EYINAEEKDI VVNMASTTGY
PMRMLVNSPT LSYNIKPNCE GLGYLLENGG KCTYIDAYYK ALETKQPGQK LTPVEKTCLC
TGMARYDCWT CGHMTYRLKD TTIRLSDGSW LLPSAEHIFL DYQFSKDHQI RLPEPEKSV