Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3952 |
Symbol | |
ID | 5672313 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4725004 |
End bp | 4726656 |
Gene Length | 1653 bp |
Protein Length | 550 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641242831 |
Product | PfaD family protein |
Protein accession | YP_001508248 |
Protein GI | 158315740 |
COG category | [R] General function prediction only |
COG ID | [COG2070] Dioxygenases related to 2-nitropropane dioxygenase |
TIGRFAM ID | [TIGR02814] PfaD family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0190583 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0375057 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTGACCC CGGTCGCGCC CGCCCCCCGC CCCGCGGCGG CTCAGCCGGC CACGGCCCAC CCCGGCGCGG CCGGGGCACT CGCCCGGCGG CCGGCCGACA TCCACCGGGT GCTCGCCGAT CTGGAGCGGC CCTGCTACGT CGTGCAGGAC GCGGCGGGGC TCGGCGTCAC CGGCGACGAG CGGGTCGCCG CGGCGGCCGG GCGGGTCCTG GCCGCGGTCG GCCCGCTGCC GCCGGAGCGC CTCGGCGCCC CGGCGTTCCG CCGCGACCAC CAGGTCGCCC AGGCGTACAT GGCCGGGTCG ATGGCTAACG GCATCGCGTC GGCCGACCTG GTCGTCGCGC TGGCCCGGGC GGGCTTCCTG GCCTCGTTCG GCGCGGCCGG CGTCGTCGCC GCCCGGGTCG ACGACGCGCT GGCCGACATC CGCCGGCGGG CACCGGGCCT GGCGTTCGGC TGCAACCTGA TCCACAGCCC GACCGAGGCG GCGATGGAGC GCGACGTCGT CGACGCCTGC CTGCGCCACC AGGTGCGCTG CGTCGAGGCG TCGGCGTTCC TGGACCTCAC CCCGCAGGTG GTGCGCTACC GGGTCGCCGG GCTGCGCCGC GGCCCCGACG GCCGGGCCGT CGCCGACAAC CGCGTCGTCG CCAAGGTGTC GCGCACCGAG GTCGCCGAGC TGTTCCTGCG GCCCGCGCCG GCCGCGCTGG TCCGTCCGCT GGTGGAGCAG GGGCTGGTCA GCGCCGAGCA GGCGGAGCTG GCGGCGACGG TGGCGATGGC CGACGACGTC ACCGCCGAGG CCGACTCCGG CGGGCACACC GACCGCCGGC CGCTGCCCGT CCTGCTGCCC GAGCTGCTCG CGCTGCGCGA CAGCCTGCGC GGCGAGTCCG GAGGGCGCAC CGTGCGGATC GGGGCGGCCG GCGGGCTCGG CACCCCGCGC GCCGTCGCCG CCGCGTTCAC GCTCGGCGCG GACTACGTGG TGACCGGCTC GGTCAACCAG GCCTCCGTCG AGGCGGCGCA GTCGCCGGCC ACCAAGACGC TGCTGGCCCA GGCGGGCGTC ACCGACTGCG TCCAGGCGCC GTCGGCGGAC ATGTTCGAGA TCGGCGCGGA CGTCCAGGTG CTCGGCCGGG GCACCATGTT CGCGTCGAAG GCGCGCCGGC TCTACGAGCT GTACCGCCGC TTCGACGGGC TCGACGAGAT CCCGGCCGAC GAACGCGCGG GCCTGGAGCA GCGGATCTTC CGCCGCTCGC TGGACGAGGT CTGGGCCGAC ACCGTCTCCT ACTTCAGCAC CCGCGACCCC GAGCAGATCG AACGCGCCCA GGAGAACCCG AAGCGGCGGA TGGCGCTGGT CTTCCGCTGG TATCTCGGGC TGTCGTCGGG CTGGAGCATC GCCGGTGCCC CGGACCGGGT CGCCGACTAC CAGGTCTGGT GCGGCCCGGC GATGGGCGCC TTCAACACCT GGGTGCGCGC CAGCGCGCTC GAGCCGCTGG CGAACCGGCA CGCCGCCGTG ATCGCCGCCG AGCTGATGCG CGGCGCCGCG TTCACCAGCC GGGCCGCCGC GCTGGCCCAG GCCGGCGTCC GGCTGCCGGC GCTCGCCACG ACCTACGTGC CGCGGCCGCA CCTGCCGCGG CCAGATTCCG ATCCGGGAGA GCAGGCACCA TGA
|
Protein sequence | MVTPVAPAPR PAAAQPATAH PGAAGALARR PADIHRVLAD LERPCYVVQD AAGLGVTGDE RVAAAAGRVL AAVGPLPPER LGAPAFRRDH QVAQAYMAGS MANGIASADL VVALARAGFL ASFGAAGVVA ARVDDALADI RRRAPGLAFG CNLIHSPTEA AMERDVVDAC LRHQVRCVEA SAFLDLTPQV VRYRVAGLRR GPDGRAVADN RVVAKVSRTE VAELFLRPAP AALVRPLVEQ GLVSAEQAEL AATVAMADDV TAEADSGGHT DRRPLPVLLP ELLALRDSLR GESGGRTVRI GAAGGLGTPR AVAAAFTLGA DYVVTGSVNQ ASVEAAQSPA TKTLLAQAGV TDCVQAPSAD MFEIGADVQV LGRGTMFASK ARRLYELYRR FDGLDEIPAD ERAGLEQRIF RRSLDEVWAD TVSYFSTRDP EQIERAQENP KRRMALVFRW YLGLSSGWSI AGAPDRVADY QVWCGPAMGA FNTWVRASAL EPLANRHAAV IAAELMRGAA FTSRAAALAQ AGVRLPALAT TYVPRPHLPR PDSDPGEQAP
|
| |