Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4076 |
Symbol | |
ID | 5672434 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4858159 |
End bp | 4859244 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641242952 |
Product | threonine aldolase |
Protein accession | YP_001508369 |
Protein GI | 158315861 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2008] Threonine aldolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000453005 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCGTCC AGCCCCCGCG AACACGGCCC GCCGGCCTGC CCGTCGACCT GCGCAGCGAC ACCGTCACCC GGCCCACGCC CGGCATGCGC CGCGCGATGG CGGAGGCGGA GGTCGGCGAC GACGTCTACC GCGAGGACCC GACCGTCCGC GAGCTCGAGG AGCATGCCGC CGCGCTGCTC GGGCACGAGG CCGCGCTGTT CGTGCCGAGC GGCACCATGG GCAACTTCTG TGCCCTGCGC GCCGGCGCCC CGGTGGGCAC CGAGGTCGTC GCCGACACCG ACGCGCACAT CGTCACCTAC GAGCTCGGCG GGCTCGCCGC CCTCGGCGGC GTGCAGACCC GGACCCTCAG CGGCCTCGCG GACACGCTCG ACCCGGCCGA CATCGCGGCC CAGCTGCGCG CGTTCCCGGT CGCGCACAAC TACAACATGG TCCGGACGAG CGTGCTCGCG GTGGAGAACA CCCGGGCCCG GGCCGGTGGC CGGGTGTGGC CCCTGGAGCG GCTCGACCGG CTGCGGGTGA TCACCGAGGC CGCCGGGGTG GTCCTGCACT GCGACGGCGC CCGCCTCTGG AACGCGGCGG TGGCGCTCGA CGTCCCGCCG CGCCGCCTCG GCGAGATCTT CGGGACGCTG TCGGTCTGCC TCTCGAAGGG CCTCGGCGCC CCGGTCGGTT CCCTTGTCGT CGGCGGCGCC GAGCACGTCG AGCGGGCCCG CGAGTGGCGC AAGCGGCTCG GCGGTGGGAT GCGCCAGGCC GGGGTGCTCG CCGCCGCCGG CCTGTACGCG CTGCGGCACC ACCTCGACCG CCTCGCCGAC GACCACCGCC ACGCCGCCGC GCTGGCCGCG ACCCTCGCCG ACGCGGCACC GCGGCGGGTC CACCCGGAGC GCACCGAGAC GAACATGGTG CTCGTCGACG TCCCGGACGC GGCCGCCTTC TGCGCGCAGG CGGCGGACGG CGGTGTGCTC GTCGGCCTGG CCGGTCCGAC GACGGTCCGG ATCGTCACCC ACCTCGACGT CGACGACACC GCGATCCGCC GGGCCGGGGA CGTCCTCGCC CCGCTGCTGA ACTCCCTGCC ACCAGCCGGT TCCTGA
|
Protein sequence | MSVQPPRTRP AGLPVDLRSD TVTRPTPGMR RAMAEAEVGD DVYREDPTVR ELEEHAAALL GHEAALFVPS GTMGNFCALR AGAPVGTEVV ADTDAHIVTY ELGGLAALGG VQTRTLSGLA DTLDPADIAA QLRAFPVAHN YNMVRTSVLA VENTRARAGG RVWPLERLDR LRVITEAAGV VLHCDGARLW NAAVALDVPP RRLGEIFGTL SVCLSKGLGA PVGSLVVGGA EHVERAREWR KRLGGGMRQA GVLAAAGLYA LRHHLDRLAD DHRHAAALAA TLADAAPRRV HPERTETNMV LVDVPDAAAF CAQAADGGVL VGLAGPTTVR IVTHLDVDDT AIRRAGDVLA PLLNSLPPAG S
|
| |