Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_0710 |
Symbol | |
ID | 3903500 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 812707 |
End bp | 813612 |
Gene Length | 906 bp |
Protein Length | 301 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 637878043 |
Product | hypothetical protein |
Protein accession | YP_479823 |
Protein GI | 86739423 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II |
TIGRFAM ID | [TIGR03089] conserved hypothetical protein TIGR03089 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.668983 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCCT CCGTGCCGCG GCTGGTCGCC GCACGGACGC TGCTTGGCCC GGCCGCGGAC GCCGACGCCG CACCCGCGGC GGGGGTTCGC GGCATCGCGT CCGTCCTCGC GCACCGGCTG GCCACCGATC CCGCCCGCCC GCTGGTCACC TTCTACGACG ACGCGACCGG CGAACGCGTG GAATTCTCGG CGACCACCTT CGACAACTGG GTGGCGAAGA CCGCGAACAT GCTGGTCGAC ACGCTCGGCC TCGGCATCGG TGACCGGGTC GGCGTCCACC TGCCGACGCA CTGGCTCAGT TCGGTCATCC TGCTCGCCAC CTGGTCGGCC GGGATGGACG CGGTCCTCGT CCGGGACGCA GCCGGCGAGC AGGACGGCGG GGACGCCGAG AAGGCCGGCG GGGAGCTCCC GGTCGCGACC CCCCTCGACG CGCTGTTCGT GGCCGAGGAC CGGCTCGACG AGGCATTCGG CCTGATGGTG GACGAGATCG TGGCCCTGTC ACTGCGCCCG CTGGGTGGTC GGATGCGCAG GCCCGTCGCC GGGGTTCTCG ACTACGCCGC GGAGGTGCCC CCGCACGGCG ATCGGTTCGC CGCCCCGGCA GCGCCGCCTG GGCAGGCCGC GCTGCTGCGC GCGGGTACCG CGATCGCGGG GGCGTGGGGC CTGGGACCCG CGGACCGGGT GCTGTTCACC GCGCCGCTCG CCACGACCGA GGGCCTGGTC GGTTCGCTGC TTGCTCCCCT GGTCGCAGGC TCGTCGATCG TGCTGTGCCG GCATCTCGAC AGCGCCGCGC TCCCGCGGCG CATCGAGACG GAGAAAATCA CCGCCGTCGG CCGGTCAGTC CTGCGGCACG CGCCCAGCCC ACTGCCCGCG GGGGTCCGGT CGCTGCCGCT CCCCCGCCTC GGGTGA
|
Protein sequence | MSASVPRLVA ARTLLGPAAD ADAAPAAGVR GIASVLAHRL ATDPARPLVT FYDDATGERV EFSATTFDNW VAKTANMLVD TLGLGIGDRV GVHLPTHWLS SVILLATWSA GMDAVLVRDA AGEQDGGDAE KAGGELPVAT PLDALFVAED RLDEAFGLMV DEIVALSLRP LGGRMRRPVA GVLDYAAEVP PHGDRFAAPA APPGQAALLR AGTAIAGAWG LGPADRVLFT APLATTEGLV GSLLAPLVAG SSIVLCRHLD SAALPRRIET EKITAVGRSV LRHAPSPLPA GVRSLPLPRL G
|
| |