Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1933 |
Symbol | trpA |
ID | 5670334 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2316180 |
End bp | 2317100 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | 641240854 |
Product | tryptophan synthase subunit alpha |
Protein accession | YP_001506276 |
Protein GI | 158313768 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0159] Tryptophan synthase alpha chain |
TIGRFAM ID | [TIGR00262] tryptophan synthase, alpha subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0945719 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.868829 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCGCACG ACCAGTCGGC CCAGCAGGCA TGGGCCCAGG AGCAGCCGGC GCGCGACCAG CCGGCGCCGA GCCGGCTCGA GCCGTCCGGG CAGCGGCCGG CCCGGCGCGG CGGGCCGAGC CCGCTGGACG AGGCCTTCGC GGCCGCGCGC AAGGACGGGC GGGCGGTGCT CGTCGGCTAT CTCCCCGCCG GGTTCCCGAC GGTGGACCGC GGCATCGCGG CGATGCGGGC GATGGTCGCG GCGGGCGTGG ACGTCGTCGA GGTCGGCCTG CCCTACTCGG ATCCGACGAT GGACGGCCCG GTCATCCAGG ACGCCGCGGA CACCGCGCTG CGCGGCGGCG TGACCACCAG GGACGTGCTG CGCACGGTCG AGGCGGTCGC CGAGACCGGG GCCCCCACCC TGGTGATGAC CTACTGGAAC CCGGTGGAGC GGTACGGCAT GGAGGCGTTC GCCGCCGACC TGGCCGCCGC CGGCGGGGCC GGGGCGATCA CCCCCGACCT GCCGCCGGAG GAGGCCGGCC CGTGGCTCGC GGCCAGCGCC ACCCACGGCC TCGACCCGGT CTTCCTGGTC GCGCCGAGCT CGACCACCGA ACGGCTGCGC CTGGTGACGG CGCACAGCGG CGGCTTCGTC TACGCGGCGT CGACCATGGG CGTCACCGGT GCGCGCGCCG CCGTCGGTGT GAAGGCGGCC GGCCTGGTCG CCCGGGTCCG GGAGGTGACC GACCTGCCTG TGGCGGTTGG CCTCGGCGTC AGCACCGGTG CTCAGGCGTC CGAGGTGGCC GGCTTCGCCG ACGGCGTCAT CGTGGGCTCG GCGCTGGTCC GGGCCCTGGC GGCCGACGCG CGGGACGGCG CCGACGGCGT CGGTGCGATC GAGCGGCTGG CGGCTGAGCT CGCCGCCGGC GTGCGTTCGG CCACCGCCTG A
|
Protein sequence | MAHDQSAQQA WAQEQPARDQ PAPSRLEPSG QRPARRGGPS PLDEAFAAAR KDGRAVLVGY LPAGFPTVDR GIAAMRAMVA AGVDVVEVGL PYSDPTMDGP VIQDAADTAL RGGVTTRDVL RTVEAVAETG APTLVMTYWN PVERYGMEAF AADLAAAGGA GAITPDLPPE EAGPWLAASA THGLDPVFLV APSSTTERLR LVTAHSGGFV YAASTMGVTG ARAAVGVKAA GLVARVREVT DLPVAVGLGV STGAQASEVA GFADGVIVGS ALVRALAADA RDGADGVGAI ERLAAELAAG VRSATA
|
| |