Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2929 |
Symbol | |
ID | 3903993 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 3450326 |
End bp | 3451987 |
Gene Length | 1662 bp |
Protein Length | 553 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 637880250 |
Product | 2-nitropropane dioxygenase, NPD |
Protein accession | YP_482016 |
Protein GI | 86741616 |
COG category | [R] General function prediction only |
COG ID | [COG2070] Dioxygenases related to 2-nitropropane dioxygenase |
TIGRFAM ID | [TIGR02814] PfaD family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0103438 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.46056 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGACGA CGACAACGAC GGTCCGTGGC GCGCCGGCGA CGCCGCGCCC TGGATCGGGC GGGCCGCGGC TCGCGCGATC CGCCCAGGAG ATCCACGACC TGCTCGCCCG GCTGGACGCG CCCTGCGTCG TGGTGAGCGA ACAGGGCGGC GCGATCGCGG CCACCGACGA CCCGGCGAGC CTGCGCGCGG CGGGGGCGAC GGTGCTGGCT GTGGCGCCGC CGGCCCGCCC CGAGCGGCTG GGAGCCGCGT CCTTCCTGAC CGACTACGGG GTGCGTCAGC CCTACATGAC CGGCGCGATG GCGAACGGCA TCGCTTCGCC GGAGCTGGTC GTGGCGATGG CACGGGCGGG CTTCCTCGCC ACGTACGGGG CGGCCGGCGT GCTGCCCGAC CGGATCGACG ACGCGCTTGG GCGGATCCGC CGTGAGCTCG GCCCGGCGCC CTTCGCCTGC AATCTGATCC ACAGCCCGAA CGAGCTGGAG CTGGAACGGG CCATCCTGGC CGCCTGCCTG CGCCACGGGG TGACCTGCGT GGAGGCGTCC GCGTTCCTGG AGCTGACCCC GCAGATCGTC GCCTACCGGG CCGCGGGGCT GCGGCCGGGC GGCGCTGGCG GCGTACACGT CGGGCACCGG GTGGTGGCCA AGGTCTCCCG CGGGGAGGTG GCCGAGCTCT TCCTGCGCCC CGCCCCGGAC GCGCTGCTGC GCCCGCTGGT GGCGGATGGC ACCCTGACCG CCGAGCAGGC CGCGCTCGCC CGCACGGTGC CGATGGCCGA CGACATCACC GTCGAGGCGG ACTCCGGCGG CCACACCGAC CGCCGGCCGC TCCCGGTCCT GCTCCCCGAG ATCATCGCGG TGCGCGACCG GATCGCCGCC GAGCTCGGCT ACCGCCGCCC GCCGAGAGTG GGAGCCGCGG GTGGCATCGG TACGCCATCG GCGGTGTTCG CCGCGTTCGC GCTCGGCGCA GCCTACGTCG TCACCGGTTC GGTGAACCAG GCGTGCGTCG AGTCCGGTCA GTCGGCGGCG GCGCGGGCGC TGCTGGCGAA GGCGGGCCCG AACGACATCG ACATGGCGCC GGCCTCCGAC ATGTTCGAGA TCGGCGCCGA GGTGCAGGTC CTGCGCCGCG GCACGATGTT CGCCGGGCGG GGCCGCCGGC TGTACGACCT CTACCGCGCC CACGACTCCC TCGACGACCT TTCGGCGGAG GATCGGAACT GGCTGGAGCG TTCGGTCCTG CGCCGGTCCG TGGACGAGGT ATGGGCCGAC ACCGTCGACT ACTTCAGCCG GCGCGACCCG GAGCAGATCG AACGCGCGCA GGCCAACCCG AAGAGACGGA TGGCGCTGGT GTTCCGCTGG TATCTCGGGC TGTCCTCGGG CTGGGCGATC TCCGCCGCGC CCGACCGGAT CACCGACTAC CAGATCTGGT GCGGCCCGTC CCTGGGCGCC TTCAACACCT GGGCGGCCGG CAGCTACCTG GCGGACGTCG ACCGGCGCAG CGCGGTGGAC GTCGCGGGTG AGCTGATGCT CGGCGCCGCC TACACCGGGC GGGCCGCGGC GCTGCGGTTC GCCGGGGTGC GGCTGCCGGC GCGGGCGGCC GCCTACCGAC CGCCGGCCAC GCGCGAGAGC TCGCCGGCGC ACCGGTACGT CCTGACGGCG GGTGCCCGGT GA
|
Protein sequence | MVTTTTTVRG APATPRPGSG GPRLARSAQE IHDLLARLDA PCVVVSEQGG AIAATDDPAS LRAAGATVLA VAPPARPERL GAASFLTDYG VRQPYMTGAM ANGIASPELV VAMARAGFLA TYGAAGVLPD RIDDALGRIR RELGPAPFAC NLIHSPNELE LERAILAACL RHGVTCVEAS AFLELTPQIV AYRAAGLRPG GAGGVHVGHR VVAKVSRGEV AELFLRPAPD ALLRPLVADG TLTAEQAALA RTVPMADDIT VEADSGGHTD RRPLPVLLPE IIAVRDRIAA ELGYRRPPRV GAAGGIGTPS AVFAAFALGA AYVVTGSVNQ ACVESGQSAA ARALLAKAGP NDIDMAPASD MFEIGAEVQV LRRGTMFAGR GRRLYDLYRA HDSLDDLSAE DRNWLERSVL RRSVDEVWAD TVDYFSRRDP EQIERAQANP KRRMALVFRW YLGLSSGWAI SAAPDRITDY QIWCGPSLGA FNTWAAGSYL ADVDRRSAVD VAGELMLGAA YTGRAAALRF AGVRLPARAA AYRPPATRES SPAHRYVLTA GAR
|
| |