Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5609 |
Symbol | |
ID | 5675768 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 6805135 |
End bp | 6806955 |
Gene Length | 1821 bp |
Protein Length | 606 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641244462 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_001509866 |
Protein GI | 158317358 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.567176 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.000155826 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | GTGGTGCGTC AGATCCGCCC GACGGATTCC GACAACGCGG CGGCCGCGGC CTCCGGCACG GACGACGGCG CGGAGGGCAT CCTGGCCCGG CCGTTCCCCG GGAGCAGGAA GACCTACCTG GTCGGGTCCC GGCCCGACCT GCGGGTGCCG ATGCGCGAGG TCGCGCTGAG CACGGGCGAC AGCCTGGTCC TCTACGACAC CTCCGGGCCG TACACCGACC CGGGCGCCGG CGTCGACGTG CGGCGCGGCC TGCCCGCGCT GCGGGCCGGC TGGATCGCCG CCCGCGGCGA CACCGAGGAG ATCGAGGGGC GTGCCGTCCA GCCCGCCGAC GACGGGCGGC GCGGGCCTGA CGGGCAGGCC GGTGACACCT TTCTCGGCTC GGGGCAGGCT CCGCGGCGGG CTACCGCCGG CCGCGCGGTG ACCCAGCTCG CCTACGCCCG GCGGGGCGAG ATCACCGCCG AGATGGAGTT CGTCGCGCTG CGTGAGGGCC TCTCTCCCGA GCTGGTGCGC GACGAGATCG CCGCGGGCCG GGCGGTGCTG CCGGCGAACG TGAACCACCC CGAGAGCGAG CCGATGGCCA TCGGCCGCAA CCTGCTGGTG AAGATCAACG CGAACATCGG CAACTCCGCC GTCGCCTCCT CCATCGAGGA GGAGGTCGAG AAGATGGTGT GGTCGACCCG CTGGGGCGCC GACACCGTCA TGGACCTCTC GACCGGGCGG AACATCCACA CCACCCGCGA GTGGATCATC CGGAACTCCC CCGTCCCGAT CGGGACGGTG CCGATCTACC AGGCGCTGGA GAAGGTCGGC GGCAAGGCCG AGGAGCTGAC CTGGGAGGTC TACCGGGACA CGGTGATCGA GCAGGCCGAG CAGGGTGTGG ACTACATGAC CGTCCACGCC GGGGTGCTGC TGCGCTACGT GCCGATGACC GCGCGGCGCA AGACCGGCAT CGTCTCGCGC GGCGGCTCGA TCCTGGCGGC ATGGTGCCTC GCCCATCATG AGGAGAACTT CCTCTACACG AACTTCGCCG AGCTCTGCGA GATCCTGCGG GCCTACGACG TCACCTTCTC CCTCGGCGAC GGTCTGCGCC CCGGCTCCAT CTCCGACGCG AACGACGCCG CGCAGCTCGC CGAGCTCGCA ACACTGGGCG AGCTGACCTC GGTGGCCTGG GAGCACGACG TCCAGGTGAT GATCGAGGGG CCGGGCCACG TGCCCATGCA CAAGATCAAG GAGAACGTGG ACCTGCAGCG CGAGCTGTGC CACGACGCGC CGTTCTACAC CCTCGGCCCG CTCACCACCG ACATCGCGCC CGGCTACGAC CACATCACCT CGGCCATCGG CGCCGCGATG ATCGGCTGGT ACGGGACGGC GATGCTCTGC TACGTCACGC CCAAGGAGCA CCTGGGCCTG CCGGACCGCG AGGACGTCAA GGCCGGGGTC ATCGCCTACA AGATCGCGGC GCACGCCGCC GACCTGGCGA AGGGGCACCC GGGCTCCCAG GCCTGGGACG ACGCCCTCTC CGACGCGCGG TTCGACTTCC GCTGGGACGA CCAGTTCAAC CTCTCACTCG ACCCGGAGAC CGCCCGGGAG TACCACGACG AGACACTGCC CGCGGCGCCC GCCAAGTCCG CGCACTTCTG CTCGATGTGC GGCCCGCACT TCTGCTCGAT GCGGATCACG CAGGACGTCC GCAAGTACGC GGCCGACCAC GGCCTCGACA CCGACGAGGC CGTCCAGGCC GGATTGGCGG AGAAGTCCCG GGAGTTCGCC GAGCAGGGTG CCCGGATCTA CCTGCCGCTC GCCGACCGGC AGACCCCCTG A
|
Protein sequence | MVRQIRPTDS DNAAAAASGT DDGAEGILAR PFPGSRKTYL VGSRPDLRVP MREVALSTGD SLVLYDTSGP YTDPGAGVDV RRGLPALRAG WIAARGDTEE IEGRAVQPAD DGRRGPDGQA GDTFLGSGQA PRRATAGRAV TQLAYARRGE ITAEMEFVAL REGLSPELVR DEIAAGRAVL PANVNHPESE PMAIGRNLLV KINANIGNSA VASSIEEEVE KMVWSTRWGA DTVMDLSTGR NIHTTREWII RNSPVPIGTV PIYQALEKVG GKAEELTWEV YRDTVIEQAE QGVDYMTVHA GVLLRYVPMT ARRKTGIVSR GGSILAAWCL AHHEENFLYT NFAELCEILR AYDVTFSLGD GLRPGSISDA NDAAQLAELA TLGELTSVAW EHDVQVMIEG PGHVPMHKIK ENVDLQRELC HDAPFYTLGP LTTDIAPGYD HITSAIGAAM IGWYGTAMLC YVTPKEHLGL PDREDVKAGV IAYKIAAHAA DLAKGHPGSQ AWDDALSDAR FDFRWDDQFN LSLDPETARE YHDETLPAAP AKSAHFCSMC GPHFCSMRIT QDVRKYAADH GLDTDEAVQA GLAEKSREFA EQGARIYLPL ADRQTP
|
| |