Gene Franean1_1917 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1917 
Symbol 
ID5670318 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2298308 
End bp2299669 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content75% 
IMG OID641240838 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_001506260 
Protein GI158313752 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0201149 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGGA ACATCAGCGG AGCGGGTGAC GGCGCCCCCT CGGGCGACAG CCCCGGCCTG 
GGCGACAGCC CCGGCCTGGG TGACGGCGCC GGCCTGGGTG ACGGCGCAGG TCCCAGCGGG
GTGCTCCCGG GGGCGCCGGC CGGCCAGGGC GCGCCCGTCG GTGGCCGCGG GCGGCTGCGC
TCCGCTGCGG CCACCCCGGC CGACCGTGCC CCGTGGCGGC CCGTCACCCT GGACGACCTG
CCCCTGCGTG ACGACCTGCG GGGGATGTCG CCCTACGGCG CGCCCCAGAT CGACGTCCCG
GTGCGGCTGA ACACCAACGA GAACCCGCAC CCGCCGTCCG CCGGGCTCGT GGACGCGCTC
GGCAAGGCCG CGACCCTCGC CGCGACCGAG GCCAACCGCT ATCCCGACCG GGAGGCCGAG
GCTCTGCGCG CCGATCTGGC GTACTACCTG ACCCCGGACG CCGGTTTCGG CGTGCACGCG
GCCCAGGTGT GGGCGGCCAA CGGGTCGAAC GAGATCCTCC AGCAGCTCTG CCAGGCGTTC
GGCGGTCCGG GGCGGGTGGC GGTGGGCTTC GAGCCGTCCT ACTCGATGCA CCGGCTGATC
GCGCTGGCGA CCGCCACCGG CTGGGTCGCC GAGTCCCGCG CGGCGGACTT CACCCTCGAC
GCCGACCGGG TCACCGCCGC GATCCGCCGG TACCGCCCGG CGCTGCTGTT CCTGTGCTCG
CCGAACAACC CCACCGCCAC CGCGCTCGGC GCGGAGGTCA TCGCGGCCGC CTGCGACGCC
ATGGCCGAGG TCGGCTCGGG TGTCGTCGTG GTCGACGAGG CCTACGGCGA GTTCCGCCGG
GCCGGCGTCC CCAGCGCGCT CACCCTGCTG CCCGACCACC CTCGGCTGGT CGTCACCCGG
ACGATGAGCA AGGCGTTCGC GTTGGCCGGC GCCAGGGTCG GCTACCTCGC GGCGCATCCG
GCGGTTGTCG ACGCGCTGCA GCTCGTCCGC CTGCCCTACC ACCTGTCGTC GTTCACCCAG
GCGGTCGCGC GCACCGCGCT CGCCCACGCC GACGAGCTGC TCGGCACAGT GGACGCGGTG
AAGGCACAGC GCGACCTGCT CGTCCGGTCC CTGCCGGAGT TCGGCTGCGT GACCGCTCCG
AGCGACGCCA ACTTCGTGCT GTTCGGCCAC TTCACCGACC AGCGCGCCGT GTGGCAGGGC
CTGCTCGACG CCGGCGTGCT CGTCCGCGAC GTCGGCCTCG ACGGCTGGCT GCGGGTGACG
GCCGGCCTGC CGAACGAGAC AGAGTCCTTC CTCGACGCGC TGCGCCGGGT GCTGACCGCC
CGCCCGGCCC TCCTGCGCGC CGCCGCGGAG ATCAGCTCCT GA
 
Protein sequence
MTGNISGAGD GAPSGDSPGL GDSPGLGDGA GLGDGAGPSG VLPGAPAGQG APVGGRGRLR 
SAAATPADRA PWRPVTLDDL PLRDDLRGMS PYGAPQIDVP VRLNTNENPH PPSAGLVDAL
GKAATLAATE ANRYPDREAE ALRADLAYYL TPDAGFGVHA AQVWAANGSN EILQQLCQAF
GGPGRVAVGF EPSYSMHRLI ALATATGWVA ESRAADFTLD ADRVTAAIRR YRPALLFLCS
PNNPTATALG AEVIAAACDA MAEVGSGVVV VDEAYGEFRR AGVPSALTLL PDHPRLVVTR
TMSKAFALAG ARVGYLAAHP AVVDALQLVR LPYHLSSFTQ AVARTALAHA DELLGTVDAV
KAQRDLLVRS LPEFGCVTAP SDANFVLFGH FTDQRAVWQG LLDAGVLVRD VGLDGWLRVT
AGLPNETESF LDALRRVLTA RPALLRAAAE ISS