Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3121 |
Symbol | |
ID | 5671499 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 3675017 |
End bp | 3676975 |
Gene Length | 1959 bp |
Protein Length | 652 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641242018 |
Product | thiamine pyrophosphate protein central region |
Protein accession | YP_001507438 |
Protein GI | 158314930 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3962] Acetolactate synthase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCGGCC AGCCCCGCGT CCACGGCCCC GACGAACGGC TGATCGACGT GCTGGCCGCG GTCCCCACCC CTGCTCCTCG GGAGGCACCG ATGAGACTCA CCATGGCACA GGCGCTCGTG CGCTTCCTTG CCAACCAGTA CTCCGAGCGA GACGGCGTAG AGCAGCGGCT CGTCCCCGGC ATGTGGGGCA TCTTCGGCCA CGGCAACGTC GCCGGCCTCG GGCAGGCGTT GTTACAGGCC GTCCGCACCG GCGAAGCGGA TCTGCCGTAC TACCTGGCCC GCAACGAGCA GGGCCAGGTC CACGCGGCCG CGGCGTTCGC CAAGATGCGC AACCGGCTGC AGGCCTTCGC GTGCACCGCG TCTACCGGCC CCGGTTCGAC GAACATGATC ACCGGCGCGG CGCTGGCGAC CACCAACCGA CTGCCCGTCC TGCTGCTGCC CAGCGACATG TTCGCGACCC GCTACCCGGA TCCGGTGCTG CAACAGCTCG AGGACACCCG CGGCGGCGAC GTGACCGTAA ACGACGCTTT CCGCCCGGTG TCGAAGTACT TCGACCGCAT CACCCGCCCC GAGCAAATGA TCCCAGCCGC TCTCGCTGCG ATGCGGGTGC TCACCGATCC GGTCGAGACC GGCGCAGTGA CGCTCGCGCT GCCGCAGGAC GTACAGGCCG AGGCCTACGA CTGGCCGGAG GACTTCTTCC GCCGGCGCGT GTGGCATGTC AGACGTCCGG CCCCTGAGCC GGAGGCGCTC GCCAGGGCCG TCGAGCTGCT GCGAACTGCC AGGTCACCGC TGATCGTCGC CGGTGGCGGT GTCGTGTACT CCGAGGGCGA GCGGGAACTG CGGGCCTTCG CCGAGATGAC CGGCATCCCG GTGGCGGACA CACACGCAGG GAAGGGAGCG GTGCCGTGGG ACCACCCGTG CGCGGTGGGT GGCATCGGCT CGACCGGCAC CTCCGCGGCC AATGCACTCG CCGCCGGGGC AGATGTCGTG CTGGGTATCG GCACCCGCTA CAGCGACTTT ACCACCGCAT CGCACACCGT CTTCAAGAAC CCTGACGTCA CGTTCGTGAA CCTCAACGTT GCCCCGTTGG ACGCAGCGAA GCACTCCGCG GAAATGCTGG TAGCGGACGC CAAACGCGGC ATCGTGGCAC TGCACAGGAC CCTGCGCGGC TGGCAGGTCG GCGACGCCTA CCGGTCCCGC ACGCGGACCC TGGCCGACGA CTGGAACCGC AGGGTCGATG CCTGCGTCAC CCCCGGCCAC GGCCCATATC CCGCCCAGAC GGAGATCCTT GGCGCACTCA ACAAGGCCCT CTGCGACCGG GACGTGGTGA TCAACGCAGC CGGGTCGATG CCCGGTGACC TGCAGCTGCT GTGGCGGGCG AGGGACCCCA AGGCCTACAA CGTTGAATAC GCCTATTCCT GCATGGGATA CGAGGTCGCC GCCGGAGTCG GGACGAAGAT GGCCGCACCC GACCGTGACG TCGTCGTCCT GGTCGGCGAT GGCTCCTACC TGATGATGGC GCAGGAAATC GTCACGATGG TCGCCGAGGG CCTCAAGGTG ATCATCGTAC TGGTGCAGAA CCACGGATTC GCCTCGATCG GCTCACTGTC GGAATCCCTG GGATCACAGC GATTCGGCAC ATCGTACCGC TACCGCGACA AGTACTCCGG CCTGCTCGAC GGCGCTTTGC TGCCGATCGA CCTCGCGGCC AACGCGGCCA GCCTCGGCGC GACTGTGATC CGGGCCGCCA CCGTCGTGGA GTTCACCACC GCGATCGCTG CGGCCAAAGC CAACACCACG ACTACCGTCG TTCACGTCGA AACTGACCTT TTTGGCCCCA ACCCGCCCAG CTCAGCCTGG TGGGACGTAC CGGTCTGCGA GGTCTCCGAG CTGGAATCAA CGCAGAAGGC CTACGAGACG TACTCTGCCG CCAAGAACAC GCAGCGGCAC TACCTGTAG
|
Protein sequence | MGGQPRVHGP DERLIDVLAA VPTPAPREAP MRLTMAQALV RFLANQYSER DGVEQRLVPG MWGIFGHGNV AGLGQALLQA VRTGEADLPY YLARNEQGQV HAAAAFAKMR NRLQAFACTA STGPGSTNMI TGAALATTNR LPVLLLPSDM FATRYPDPVL QQLEDTRGGD VTVNDAFRPV SKYFDRITRP EQMIPAALAA MRVLTDPVET GAVTLALPQD VQAEAYDWPE DFFRRRVWHV RRPAPEPEAL ARAVELLRTA RSPLIVAGGG VVYSEGEREL RAFAEMTGIP VADTHAGKGA VPWDHPCAVG GIGSTGTSAA NALAAGADVV LGIGTRYSDF TTASHTVFKN PDVTFVNLNV APLDAAKHSA EMLVADAKRG IVALHRTLRG WQVGDAYRSR TRTLADDWNR RVDACVTPGH GPYPAQTEIL GALNKALCDR DVVINAAGSM PGDLQLLWRA RDPKAYNVEY AYSCMGYEVA AGVGTKMAAP DRDVVVLVGD GSYLMMAQEI VTMVAEGLKV IIVLVQNHGF ASIGSLSESL GSQRFGTSYR YRDKYSGLLD GALLPIDLAA NAASLGATVI RAATVVEFTT AIAAAKANTT TTVVHVETDL FGPNPPSSAW WDVPVCEVSE LESTQKAYET YSAAKNTQRH YL
|
| |