Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal223_2932 |
Symbol | |
ID | 7089000 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS223 |
Kingdom | Bacteria |
Replicon accession | NC_011663 |
Strand | + |
Start bp | 3454430 |
End bp | 3456076 |
Gene Length | 1647 bp |
Protein Length | 548 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643461817 |
Product | PfaD family protein |
Protein accession | YP_002358841 |
Protein GI | 217974090 |
COG category | [R] General function prediction only |
COG ID | [COG2070] Dioxygenases related to 2-nitropropane dioxygenase |
TIGRFAM ID | [TIGR02814] PfaD family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.266003 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTAGCC ATACTCTAGA TCAATTCAAT AGTAATAACG AAAAACTTAG CCCTTGGCCG TGGCAAGTCA ACGATGCAGA GCTGAGCTTT GATATCGACT CATTAGGCAA AAAACTCAAA GATTTAAGCC AAGCCTGCTA CTTAGTGAAT CACAGCGAAA AAGGCTTAGG CATAGCGCAA ACGGCCGAAG TGACCACCAG TGACAGCCAA GCGCCATTGG GCTCACACCC TGTCAGCGCC TTTGCCCCTG CCCTTGGCAC CCAAAGTTTA GGCGACAGTA ATTTTCGCCG CGTCCACGGT GTCAAATACG CTTACTACGC CGGCGCTATG GCAAACGGTA TTGCCTCGGA AGAACTGGTT ATCGCCCTTG GCCAAGCGGG TATTTTATGC TCATTTGGCG CGGCGGGGTT AATTCCATCG CGTGTTGAAG CGGCCATTAC CCGCATTCAA GCGGCGCTGC CCAATGGCCC TTACGCTTTT AACTTAATTC ACAGCCCAAG CGAGCCCGCA TTAGAGCGCG GTAGCGTTGA GCTGTTTTTA AAACATAAAG TGCGCACGGT CGAAGCCTCG GCATTTTTAG GTTTAACGCC GCAGATCGTC TATTACCGTG CAGCGGGGTT GAGCCGCGAC GAACATGGTG AGATAGTCAT TGGCAATAAA GTCATAGCAA AAATCAGCCG CACCGAAGTC GCGACTAAGT TTATGGAGCC GGCGCCGGCC AAAATTCTCC AGCAATTAGT GAGTGACGGT CTTATCAGCC AAGACCAAAT GGCGATGGCG CAACTCGTGC CAATGGCGGA CGACATCACG GCCGAAGCCG ACTCAGGCGG CCATACCGAC AATCGTCCAC TGGTCACGCT ATTGCCGACG ATTTTGGCGC TCAAAGATGA AATCCAAGCT AAATATCAAT ATAAAACGCC CATCCGTGTG GGTGCAGGCG GCGGCGTCGG TACTCCCGAT GCAGCACTTG CCACCTTCAA CATGGGCGCG GCCTTTATCG TCACAGGTTC TATCAACCAA GCTTGCGTCG AAGCGGGCGC GAGCGAACAC ACCCGTAAGT TACTCGCCAC CACAGAAATG GCCGATGTGA CTATGGCACC TGCCGCCGAT ATGTTCGAAA TGGGCGTGAA ATTACAAGTG GTTAAGCGCG GCACCCTGTT CCCGATGCGC GCCAATAAGC TTTATGAGAT CTACACCCGT TACGATTCGA TTGATGCTAT CCCTACGGAC GAGCGTAAAA AGCTCGAAGA GCAAGTGTTC CGCTCATCAC TCGATGACAT TTGGGCGAGT ACTGTCGCCC ACTTTAACGA GCGCGATCCA AAGCAAATCG AGCGCGCACT GGATAATCCC AAACGTAAGA TGGCGCTGAT TTTCCGCTGG TATTTGGGTC TTTCGAGCCG CTGGTCAAAC ACAGGTGAAA TCGGCCGCGA AATGGATTAC CAAATCTGGG CAGGCCCAGC ACTGGGTGCA TTTAACGCTT GGGCGAAAGG CAGTTATTTA GATGACTATA AAGCCCGTAA TGCGGTGGAT TTAGCCAAAC ATTTAATGGT GGGCGCGGCG TATCAATCCC GCATTAACTT GCTGTTATCT CAAGGTGCGA GCATTCCGGT CACACTACTA CGCTGGAAAC CGCTGAATCG TTTTTAA
|
Protein sequence | MTSHTLDQFN SNNEKLSPWP WQVNDAELSF DIDSLGKKLK DLSQACYLVN HSEKGLGIAQ TAEVTTSDSQ APLGSHPVSA FAPALGTQSL GDSNFRRVHG VKYAYYAGAM ANGIASEELV IALGQAGILC SFGAAGLIPS RVEAAITRIQ AALPNGPYAF NLIHSPSEPA LERGSVELFL KHKVRTVEAS AFLGLTPQIV YYRAAGLSRD EHGEIVIGNK VIAKISRTEV ATKFMEPAPA KILQQLVSDG LISQDQMAMA QLVPMADDIT AEADSGGHTD NRPLVTLLPT ILALKDEIQA KYQYKTPIRV GAGGGVGTPD AALATFNMGA AFIVTGSINQ ACVEAGASEH TRKLLATTEM ADVTMAPAAD MFEMGVKLQV VKRGTLFPMR ANKLYEIYTR YDSIDAIPTD ERKKLEEQVF RSSLDDIWAS TVAHFNERDP KQIERALDNP KRKMALIFRW YLGLSSRWSN TGEIGREMDY QIWAGPALGA FNAWAKGSYL DDYKARNAVD LAKHLMVGAA YQSRINLLLS QGASIPVTLL RWKPLNRF
|
| |