Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3186 |
Symbol | |
ID | 3903911 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 3775035 |
End bp | 3776555 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 637880510 |
Product | bifunctional 3,4-dihydroxy-2-butanone 4-phosphate synthase/GTP cyclohydrolase II protein |
Protein accession | YP_482272 |
Protein GI | 86741872 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0108] 3,4-dihydroxy-2-butanone 4-phosphate synthase [COG0807] GTP cyclohydrolase II |
TIGRFAM ID | [TIGR00505] GTP cyclohydrolase II [TIGR00506] 3,4-dihydroxy-2-butanone 4-phosphate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000678178 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.119885 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACAT CCGCGGCCGG CGCCTCCTCG ACGGCCACCC TTCCCGCCGG TGCCCTTCCC GCCGGCTACG CCGACGTGCG GTTCGCCGCC ATCGAGGACG CGCTCGCCGA GATCGCCGCC GGCCGCCCGC TCGTCGTCGT CGACGACGCC GACCGGGAGA ACGAGGGTGA TCTGATCTTC GCGGCGGAGG CCGCGACACC CGAGCTGGTC GCGTTCATGA CGCGGTACAC CTCCGGTGTG ATCTGCGTGC CGATGGACGC CGCCGACACC GACCGGCTCG AACTGGACCA GATGGTCCCG CACAACACGG AGCGGATGGG CACCGCGTTC ACCATCACCG TGGACGCCCG CGACGGCGTC TCCACCGGGA TCTCCGCCGC CGACCGGGCC CGCACCATCC GGTTGCTGGC CGACCCGCGG ACCACGCCCG GCGACCTGAG CCGGCCGGGC CACATCTTCC CGCTGCGGGC CAAGGACGGC GGGGTGCTGC GCCGCCCCGG CCACACCGAG GCCGCGGTCG ACCTGGCCCG GCTCGCCGGG CTGCGGCCGG CCGGTGCGAT CTGCGAGATC GTGAACGACG ACGGGACGAT GGCCCGGCTT CCCCGGCTCG CCGAGTTCGC CCGCGAGCAC GGGCTGCTGC TGATCTCCAT CGCCGATCTC ATCGCTTACC GTCGGCGGAC CGAGAAGCAG GTGGTGCGGG TCGCGGAGGC GTCGCTTCCG ACGCGTTACG GGGCGTTCCG GGCGGTCGGG TTCCGCAACA TCCTCGACGG CGTCGAACAC ATCGCGCTGA TCCGCGGCGA GCTCGGTGAC GGCGCGGACG TGCTGGTCCG GGTGCACAGC GAATGCCTCA CCGGGGACGT CTTCGGCTCC CGGCGCTGCG ACTGCGGAAC CCAGCTCGAC GCGGCACTGC GCATAATCGC GACCGAGGGA CGCGGAGTGG TGCTCTACAT GCGGGGCCAC GAGGGCCGGG GCATCGGCCT GATGCACAAG CTGCGGGCCT ACCAGCTGCA GGACGCCGGC CACGACACCG TCGACGCCAA CCTCGCGCTG GGTCTGCCGG CCGACGCCCG TGACTACGGC ACCGGGGCCC AGATCCTGGT CGACCTCGGC GTGCGCGGCA TCCGGCTGCT GTCCAACAAT CCGACGAAGC GGGCCGGGCT GGAGGGCTAC GGCCTGCGCA TCGTCGAACT CATCGGGATG CCGGTCACCC AGACGCCGGA GAACCTGCGC TACCTGACCA CCAAGCGGGA CCGGATGGGA CACGTCATTC CCGGTCTGCC CAGCATTCCC GGTCTGCCCA GCATTCCCGG TCCGCCCGGC CTTCCGGGTC CGCCCGGCCT TCCGGGTCCA CCCGGCCTGT CCGATCTGCC TAGGGCCGTG GGCGAGGACG AGCCGGCCGG CTCGCGCTGG ATCCCGGGGG CGCCGAGGGC GCCGAGGGTA CCCGTCACCT CCGGCTGCAC GGCCGTCGCG GGCCCGGGTA CGTCCGCCGG GGTCGACTGT GCCGAGGGGG AGGGTCTGTG A
|
Protein sequence | MTTSAAGASS TATLPAGALP AGYADVRFAA IEDALAEIAA GRPLVVVDDA DRENEGDLIF AAEAATPELV AFMTRYTSGV ICVPMDAADT DRLELDQMVP HNTERMGTAF TITVDARDGV STGISAADRA RTIRLLADPR TTPGDLSRPG HIFPLRAKDG GVLRRPGHTE AAVDLARLAG LRPAGAICEI VNDDGTMARL PRLAEFAREH GLLLISIADL IAYRRRTEKQ VVRVAEASLP TRYGAFRAVG FRNILDGVEH IALIRGELGD GADVLVRVHS ECLTGDVFGS RRCDCGTQLD AALRIIATEG RGVVLYMRGH EGRGIGLMHK LRAYQLQDAG HDTVDANLAL GLPADARDYG TGAQILVDLG VRGIRLLSNN PTKRAGLEGY GLRIVELIGM PVTQTPENLR YLTTKRDRMG HVIPGLPSIP GLPSIPGPPG LPGPPGLPGP PGLSDLPRAV GEDEPAGSRW IPGAPRAPRV PVTSGCTAVA GPGTSAGVDC AEGEGL
|
| |