Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0720 |
Symbol | |
ID | 5669136 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 837098 |
End bp | 838027 |
Gene Length | 930 bp |
Protein Length | 309 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641239647 |
Product | taurine dioxygenase |
Protein accession | YP_001505084 |
Protein GI | 158312576 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2175] Probable taurine catabolism dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.047481 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGATGA CCAGCCAGGT CCAGACACTG ATCGACGTCC GGCCGTTGTC CGGATACACC GGCGCCGAGA TCCACGGCGT CGACCTGCGG GAGGAACTCG ACGACGCGAC CATCGCCGAG ATCCGGTCCG CGCTGCTGAC CTGGAAGGTG GTGTTCTTCC GCGACCAGAA CCTGGACCAC GCCCAGCAGG TGGCGTTCGG GCGGCGCTTC GGCAAGCTCA CCCCCGCCCA CCCGCACGAG ACCGCGCCCC CGGAGGGGTT CCCCGAGATC CTGCCGATCG ACAGCCGGCG CTACTCCGAA ATCATCGGCA AGCGGAAGGT CACCTACGAC AACGGGTGGC ACACCGACGT GACGGCGCTG GTCAACCCGC CGGCCGGGTC GATTCTGCGC GCCGACATCG TCCCGCCCTA CGGCGGCGAC ACCGCCTGGA CGAACCTGGT CGCCGCCTAC CAGGCCCTGC CCGAGCCGCT GCGGACGCTG GCCGACAGCC TGCGCGCGCG GCACAGCTTC AACCTGCAGA TCTTCGACGG CGGCGAGTAC GGGAAGCGGA TCGCCTCGAA CCCGCTGGTC GCCATCCACC CGGTGGTCCG GGTGCACCCG GAGACCGGCG AGCGGGCCCT GTTCGTCAGC CCCAGCTTCA CCGCGCGTGA CAACGAGATC ATCGGCCTGT CCGCCCGGCA GAGCCACCGT GTCCTCGAGT TGTTCTACGA GCAGATCGCC CGGCCGGAGT TCACGGTGCG GTTCAAGTGG AACCCGGGTG ACATCGCCTT CTGGGACAAC CGCGCCACCG CGCATCTCGG GCCGTCCGAC CTCGGTCATC TCGACTTCGA CCGGGTTCTC TACCGGGTGA CCCTCGAGGG CGACGTCCCG GTCGGTGCCG ACGGGGCGGA GTCGGAGCTG GTCGCCGGCC AGCCGTTCCT CGGCTCCTGA
|
Protein sequence | MTMTSQVQTL IDVRPLSGYT GAEIHGVDLR EELDDATIAE IRSALLTWKV VFFRDQNLDH AQQVAFGRRF GKLTPAHPHE TAPPEGFPEI LPIDSRRYSE IIGKRKVTYD NGWHTDVTAL VNPPAGSILR ADIVPPYGGD TAWTNLVAAY QALPEPLRTL ADSLRARHSF NLQIFDGGEY GKRIASNPLV AIHPVVRVHP ETGERALFVS PSFTARDNEI IGLSARQSHR VLELFYEQIA RPEFTVRFKW NPGDIAFWDN RATAHLGPSD LGHLDFDRVL YRVTLEGDVP VGADGAESEL VAGQPFLGS
|
| |