Gene Franean1_1916 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1916 
SymbolhisD 
ID5670317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2297016 
End bp2298311 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content76% 
IMG OID641240837 
Producthistidinol dehydrogenase 
Protein accessionYP_001506259 
Protein GI158313751 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.215288 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTCAGAA GGCTCGATCT GCGCTCCGCG CAGCCCACCA TGTCGGACGT CCGTCGGCTC 
TTCCCGCGGG CGGCCGTGGA CGTCGACGTC GCGATGGACG CGGTCCGCCC CGTCTGCGAC
GACGTGCGCG ACCGTGGTGA CGCGGCCGTC CTCGACGCCG CGGAGCGCTT CGACCGGGTC
CGCCCGGCCG AGCTGCGGGT ACCGGCCGAC GCGCTCGCGA GCGCGCTCGC GGCCCTCGGC
CCGGCGGTCC GTGACGCGCT GACCGAGGCG ATCCGCCGCG CGCGGCTGGT GCACCGGGCC
CAGCTCCGGG AGCCGGTGGT CGTCGAGGTG GCCCCCGGCA CGAAGGTCAC CGAGCGCTGG
ATCCCGGTCG GCCGGGTCGG CCTCTACGTG CCGGGCGGGC GGGTCGCCTA CCCCAGCAGC
GTGGTCATGA ACGTCGTCCC CGCGCAGGAG GCCGGCGTCG CCTCGCTGGC GGTGACCTCG
CCGCCCCAGG TCGACAACGG CGGCCTGCCG CATCCGGTCG TACTGGCCGC CTGCGCCCTG
CTCGGGGTCG ACGAGGTCTA CGCGGCCGGC GGCGCCCAGG CCGTCGCGAT GTTCGCGCAC
GGCACCGAGA GCTGCCCGGC CGTCGATGTC GTCACCGGCC CCGGCAACGT CTACGTCACC
GCGGCGAAGC GGCTGCTGCG CGGGCTGGTC GGCGTCGACG CCGAGGCGGG CCCGACCGAG
GTCGCCATTC TCGCCGACGG CTCGGCCCGC CCCGACTTCG TCGCCGCCGA CCTGATCGCG
CAGGCCGAGC ACGACCCGAT GGCCGCCTGC CTGCTGGTCA CGACGTCGCC AGAGCTGGCC
GACGCCGTCG ACGTCGAGCT CGACAAGCAG GTCCCCGCCA CCCGGCACCG GGAGCGGGTC
ACTGAGGCGC TGGCCGGCCA GGGCGCCGTG GCGATCGTCG CCGACGTCGA CGCGGGTCTC
GCGGTCGTCG ACGCCTGGGC CGCCGAGCAC CTGGAGATCC ACACCGCGGA CGCGGCGGGT
GTCGCCGCCC GGGTGCGCAA CGCGGGCGCG ATCTTCGTCG GCGCCTACGC GCCCGTGCCA
CTCGGGGACT ACCTCGCCGG CTCGAACCAC GTCCTGCCCA CCGGCGGCAC CGCGCGGCAC
TCCAGCGGCC TCGCCGTGTC CGCCTTCCAG CGCCAGGTCC ATGTCGTCGA GTGCGGCCCC
GAGGCGCTCG CCGACGTCGC GCCCCGCATC GCCGCGCTCG GCGGAGCCGA GGACCTGATC
GCCCACGTCG ACGCGGTGGA GGTACGGGCC CGATGA
 
Protein sequence
MLRRLDLRSA QPTMSDVRRL FPRAAVDVDV AMDAVRPVCD DVRDRGDAAV LDAAERFDRV 
RPAELRVPAD ALASALAALG PAVRDALTEA IRRARLVHRA QLREPVVVEV APGTKVTERW
IPVGRVGLYV PGGRVAYPSS VVMNVVPAQE AGVASLAVTS PPQVDNGGLP HPVVLAACAL
LGVDEVYAAG GAQAVAMFAH GTESCPAVDV VTGPGNVYVT AAKRLLRGLV GVDAEAGPTE
VAILADGSAR PDFVAADLIA QAEHDPMAAC LLVTTSPELA DAVDVELDKQ VPATRHRERV
TEALAGQGAV AIVADVDAGL AVVDAWAAEH LEIHTADAAG VAARVRNAGA IFVGAYAPVP
LGDYLAGSNH VLPTGGTARH SSGLAVSAFQ RQVHVVECGP EALADVAPRI AALGGAEDLI
AHVDAVEVRA R