Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_0383 |
Symbol | |
ID | 3903435 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 456092 |
End bp | 458050 |
Gene Length | 1959 bp |
Protein Length | 652 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637877712 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_479499 |
Protein GI | 86739099 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGTCGC GTACCGACCG TTCGTCCTCG TCCACCTCGA AGGCCGTCAC CTCGTCCCCG TCCACCTCAT CCTTGTCGTC CGCCGCATCC TCCCCGTCCG TCTCGTCCTC GTCCTCCTCG TCGTCCGTCT CGGCCGCGGG GATGACGGCG GTCTCGACGG GTCCGCTGAC GGGCAGCCGC AAGACCTGGC TGGTCGGAGC GGATCCCGAT CTGCGGGTGC CGATGCGGGA GATCGTGCTG ACCACGGGTG ACACCGTCGT GGTGTACGAC ACCTCCGGTC CATATACCGA TCCGGGGGTG ACGATCGATG TGCGCCGGGG TCTGCCGGCG ACGCGGGACA GCTGGATCGC CCAGCGTGGC GACACCGCGC CGGATGAGCG GCGGACCGTT CCCGGGACCG GGGCATCGGG TCCTGGGACG CTCGGTTCTG GGACGCCCGG TTCTGGGACG CCCGGTTCTG GGCCGCTCGG TCTCGGCGGG ACGGATCTCG ACGGGCGGGT GCGAGTGCCC CGTCGGGCGG TGCCGGGACG GCCTTCCATA ACCCAGCTGG GCTACGCCCG CCGCGGTCAG ATCACTCGGG AGATGGAGTT CGTCGCCCTG CGGGAGGGTC TTCCCGTCGA GACGGTGCGG GCGGAGATCG CCGCGGGGCG GGCCGTGCTG CCGGCGAACG TGAACCATCC CGAGTCCGAG CCGATGGCGA TCGGCCGTGC GTTTCTCGTG AAGATCAATG CGAACCTCGG CAACTCGGCC GTTACCTCCT CGATCGAGGA GGAGGTCGAG AAGATGGTGT GGGCGACCCG CTGGGGCGCG GACACCGTGA TGGACCTCTC GACCGGGTCG GACATCGCCC TGACCCGTGA GTGGATCATC CGTAACGCGC CGGTGCCGGT CGGGACCGTG CCGATCTACC AGGCGTTGGA GAAGGTCGGT GGCCGGCCGG AGAAGCTGTC CTGGGAGGTC TACCGGGACA CCGTGATCGA GCAGTGCGAG CAGGGTGTGG ACTACATGAC GGTCCACGCG GGGGTGCTGC TGCGCTACGT GCCGCTGACC GCGCGGCGCA GGACCGGGAT CGTCTCGCGC GGCGGCTCGA TCCTGGCCTC CTGGTGCCTG GCCCATCACG AGGAGAACTT CCTCTACACC CACTTCGCCG AGCTGTGCGA GATCTTCGCC GCGTACGACG TCACGTTCTC TCTGGGCGAC GGCCTGCGGC CCGGGTCCAT CGCGGACGCG AACGATGAGG CCCAGCTCGC CGAGCTCGCC ACCCTGGGCG AGTTGACGCA GGTGGCGTGG GAGCACGACG TCCAGGTGAT GATCGAGGGA CCCGGGCACG TGCCCATGAA CAAGATCGAG GAGAACGTGC AGCTGCAGCG GGAGCTGTGC CACGACGCGC CGTTCTACAC CCTCGGGCCG CTGACCACCG ACATCGCCCC CGGCTACGAC CACATCACCT CCGCGATCGG GGCGGCGATG ATCGGATGGG CCGGTACCGC CATGCTCTGT TACGTCACCC CGAAGGAGCA TCTCGGCCTG CCCGACCGGG ACGACGTCAA GGCCGGCGTT ATCGCCTACA AGATCGCCGC GCACGCCGCC GACCTCGCTA AGGGGCATCC CGGCGCGCAG GCCTGGGATG ACGCCCTGTC GGACGCCCGG TTCGAATTCC GCTGGGCCGA CCAGTTCCAC CTCGCGCTCG ACCCCGACAC CGCACGCGCG TTCCACGACG AGACGCTGCC GGCCCCGGCC GCGAAGTCGG CGCACTTCTG TTCGATGTGC GGCCCGCACT TCTGCTCGAT GAAGATTTCC CACCAGGTGC GGGCACACGC GGGCGGGGAC GGGCTGGACC CGGCCGGGCA TGGCGCGGAC CCGGCCGGCG ACGAGGCGGT CACCGCCGGC CTGCGCGAGA AGGCCGCCGA GTTCAACGCC GCCGGCAATC GGATCTACCT TCCGGTGGCC AACTCCTGA
|
Protein sequence | MVSRTDRSSS STSKAVTSSP STSSLSSAAS SPSVSSSSSS SSVSAAGMTA VSTGPLTGSR KTWLVGADPD LRVPMREIVL TTGDTVVVYD TSGPYTDPGV TIDVRRGLPA TRDSWIAQRG DTAPDERRTV PGTGASGPGT LGSGTPGSGT PGSGPLGLGG TDLDGRVRVP RRAVPGRPSI TQLGYARRGQ ITREMEFVAL REGLPVETVR AEIAAGRAVL PANVNHPESE PMAIGRAFLV KINANLGNSA VTSSIEEEVE KMVWATRWGA DTVMDLSTGS DIALTREWII RNAPVPVGTV PIYQALEKVG GRPEKLSWEV YRDTVIEQCE QGVDYMTVHA GVLLRYVPLT ARRRTGIVSR GGSILASWCL AHHEENFLYT HFAELCEIFA AYDVTFSLGD GLRPGSIADA NDEAQLAELA TLGELTQVAW EHDVQVMIEG PGHVPMNKIE ENVQLQRELC HDAPFYTLGP LTTDIAPGYD HITSAIGAAM IGWAGTAMLC YVTPKEHLGL PDRDDVKAGV IAYKIAAHAA DLAKGHPGAQ AWDDALSDAR FEFRWADQFH LALDPDTARA FHDETLPAPA AKSAHFCSMC GPHFCSMKIS HQVRAHAGGD GLDPAGHGAD PAGDEAVTAG LREKAAEFNA AGNRIYLPVA NS
|
| |