Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1917 |
Symbol | |
ID | 5670318 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2298308 |
End bp | 2299669 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641240838 |
Product | histidinol-phosphate aminotransferase |
Protein accession | YP_001506260 |
Protein GI | 158313752 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase |
TIGRFAM ID | [TIGR01141] histidinol-phosphate aminotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0201149 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGGGA ACATCAGCGG AGCGGGTGAC GGCGCCCCCT CGGGCGACAG CCCCGGCCTG GGCGACAGCC CCGGCCTGGG TGACGGCGCC GGCCTGGGTG ACGGCGCAGG TCCCAGCGGG GTGCTCCCGG GGGCGCCGGC CGGCCAGGGC GCGCCCGTCG GTGGCCGCGG GCGGCTGCGC TCCGCTGCGG CCACCCCGGC CGACCGTGCC CCGTGGCGGC CCGTCACCCT GGACGACCTG CCCCTGCGTG ACGACCTGCG GGGGATGTCG CCCTACGGCG CGCCCCAGAT CGACGTCCCG GTGCGGCTGA ACACCAACGA GAACCCGCAC CCGCCGTCCG CCGGGCTCGT GGACGCGCTC GGCAAGGCCG CGACCCTCGC CGCGACCGAG GCCAACCGCT ATCCCGACCG GGAGGCCGAG GCTCTGCGCG CCGATCTGGC GTACTACCTG ACCCCGGACG CCGGTTTCGG CGTGCACGCG GCCCAGGTGT GGGCGGCCAA CGGGTCGAAC GAGATCCTCC AGCAGCTCTG CCAGGCGTTC GGCGGTCCGG GGCGGGTGGC GGTGGGCTTC GAGCCGTCCT ACTCGATGCA CCGGCTGATC GCGCTGGCGA CCGCCACCGG CTGGGTCGCC GAGTCCCGCG CGGCGGACTT CACCCTCGAC GCCGACCGGG TCACCGCCGC GATCCGCCGG TACCGCCCGG CGCTGCTGTT CCTGTGCTCG CCGAACAACC CCACCGCCAC CGCGCTCGGC GCGGAGGTCA TCGCGGCCGC CTGCGACGCC ATGGCCGAGG TCGGCTCGGG TGTCGTCGTG GTCGACGAGG CCTACGGCGA GTTCCGCCGG GCCGGCGTCC CCAGCGCGCT CACCCTGCTG CCCGACCACC CTCGGCTGGT CGTCACCCGG ACGATGAGCA AGGCGTTCGC GTTGGCCGGC GCCAGGGTCG GCTACCTCGC GGCGCATCCG GCGGTTGTCG ACGCGCTGCA GCTCGTCCGC CTGCCCTACC ACCTGTCGTC GTTCACCCAG GCGGTCGCGC GCACCGCGCT CGCCCACGCC GACGAGCTGC TCGGCACAGT GGACGCGGTG AAGGCACAGC GCGACCTGCT CGTCCGGTCC CTGCCGGAGT TCGGCTGCGT GACCGCTCCG AGCGACGCCA ACTTCGTGCT GTTCGGCCAC TTCACCGACC AGCGCGCCGT GTGGCAGGGC CTGCTCGACG CCGGCGTGCT CGTCCGCGAC GTCGGCCTCG ACGGCTGGCT GCGGGTGACG GCCGGCCTGC CGAACGAGAC AGAGTCCTTC CTCGACGCGC TGCGCCGGGT GCTGACCGCC CGCCCGGCCC TCCTGCGCGC CGCCGCGGAG ATCAGCTCCT GA
|
Protein sequence | MTGNISGAGD GAPSGDSPGL GDSPGLGDGA GLGDGAGPSG VLPGAPAGQG APVGGRGRLR SAAATPADRA PWRPVTLDDL PLRDDLRGMS PYGAPQIDVP VRLNTNENPH PPSAGLVDAL GKAATLAATE ANRYPDREAE ALRADLAYYL TPDAGFGVHA AQVWAANGSN EILQQLCQAF GGPGRVAVGF EPSYSMHRLI ALATATGWVA ESRAADFTLD ADRVTAAIRR YRPALLFLCS PNNPTATALG AEVIAAACDA MAEVGSGVVV VDEAYGEFRR AGVPSALTLL PDHPRLVVTR TMSKAFALAG ARVGYLAAHP AVVDALQLVR LPYHLSSFTQ AVARTALAHA DELLGTVDAV KAQRDLLVRS LPEFGCVTAP SDANFVLFGH FTDQRAVWQG LLDAGVLVRD VGLDGWLRVT AGLPNETESF LDALRRVLTA RPALLRAAAE ISS
|
| |