Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3803 |
Symbol | |
ID | 3905551 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 4560608 |
End bp | 4561780 |
Gene Length | 1173 bp |
Protein Length | 390 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637881129 |
Product | molybdopterin biosynthesis-like protein MoeZ |
Protein accession | YP_482882 |
Protein GI | 86742482 |
COG category | [H] Coenzyme transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 [COG0607] Rhodanese-related sulfurtransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.209325 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.526883 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCCTTC CGCCCCTGGT CGACCCCGCC GAGGGGCTGA CCGTCGACGA GATCCGCCGG TACTCTCGGC ATCTGATCAT TCCGGATGTC GCCATGGACG GCCAGAAGCG GCTGAAGAAC GCCAGGGTGC TGGCGGTCGG TGCCGGTGGC CTCGGCTCGC CGACGCTGAT GTACCTGGCC GCCGCCGGCG TCGGGACGCT AGGCATCGTC GAGTTCGACA CCGTTGACGA GTCGAACCTG CAGCGTCAGA TCATCCACGG CCAGTCCGAC GTCGGCCGCT CGAAGGCGGA GTCGGCCCGT GACTCGGTTC GCAACATCAA CCCGTACGTG AACGTCGTCC TGCACGAGAC CCGGCTGGAC GCCTCCAACG TCATGGAGAT CTTCAGCGGG TACGACCTCA TCGTCGACGG CACGGACAAC TTCGCCACCC GTTACCTGGT CAACGACGCC GCGGTGCTGC TCGGCAAGCC CTACGTCTGG GGTTCGATCT ACCGCTTCGA CGGTCAGGCC AGCGTCTTCT GGGCCGAGCA CGGACCGTGC TACCGCTGCC TCTACCCGGA GCCGCCTCCT CCCGGCATGG TCCCCTCCTG CGCCGAGGGC GGGGTGCTGG GTGTGCTGTG CGCCTCCATC GCCTCCATCC AGACCACCGA GGCCATCAAG GTGCTGACCG GGGTCGGTGA TCCGCTGGTC GGTCGGCTGA TGGTGTATGA CGCCCTGGAG ATGACCTATC GGTCGATCAA GGTCCGCAAG GACCCGGAGT GCCCGTTGTG CGGGAAGAAC CCGACGATCA CCGAGCTGAT CGACTACGAG GCGTTCTGCG GGGCGGTCTC GGAGGAGGCG CAGTTGGCCG CCGCCGGCTC GACGATCACC GCGGGCGAGC TCAAGAGCTG GCTGGATGCC GGCGAGCCGA TCGAGCTCGT CGACGTCCGT GAGCCGGCCG AGTGGGAGAT CGTCCGGATC CCCGGCGCGC GCCTGATCCC CAAGGGGGAC CTGCCCGCGC ATCTCTCCGA ACTGCCGCAG CACCGTCGGG TGGTCGTCTA CTGCAAGTCC GGGGTGCGCT CGGCCGACGC GCTCGCCACG CTGAAGGGCG CAGGCTTCTC CTCTGCCGTG CACGTCCAGG GTGGCGTGAC CGCGTGGGCG ATCCAGGTCG ACAAGTCGCT GCCGGTCTAC TGA
|
Protein sequence | MSLPPLVDPA EGLTVDEIRR YSRHLIIPDV AMDGQKRLKN ARVLAVGAGG LGSPTLMYLA AAGVGTLGIV EFDTVDESNL QRQIIHGQSD VGRSKAESAR DSVRNINPYV NVVLHETRLD ASNVMEIFSG YDLIVDGTDN FATRYLVNDA AVLLGKPYVW GSIYRFDGQA SVFWAEHGPC YRCLYPEPPP PGMVPSCAEG GVLGVLCASI ASIQTTEAIK VLTGVGDPLV GRLMVYDALE MTYRSIKVRK DPECPLCGKN PTITELIDYE AFCGAVSEEA QLAAAGSTIT AGELKSWLDA GEPIELVDVR EPAEWEIVRI PGARLIPKGD LPAHLSELPQ HRRVVVYCKS GVRSADALAT LKGAGFSSAV HVQGGVTAWA IQVDKSLPVY
|
| |