Gene Francci3_3027 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3027 
SymbolhisD 
ID3904380 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3592383 
End bp3593678 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content75% 
IMG OID637880347 
Producthistidinol dehydrogenase 
Protein accessionYP_482113 
Protein GI86741713 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0409486 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTCAGAA GGCTTGATCT GCGTGGCGCG GTGCCTGCCG CAGCCGAGAT CCGACGGCTA 
CTGCCGCGGG CCGACGTCGA CGTCGACGTC GCCGTCGATG CGGTCCGGCC GGTCTGTGCG
GACGTGCGGG ACCGCGGCGA CGCGGCCGTC CTCGCCGCCG CGGAGCGGTT CGACGGGGTC
CGCCCGCCGA GCCTGCGGGT GCCCGCCCAG GTTCTCGCCG CCGCCGCGGA TGTCCTGGAT
CCGGCCGTGC GCGACGCGCT GACCGAGGCG ATCCGCCGGA CTCGGCTGGT GCACCGTGCG
CAGCTGCGGG AGCCGGTGGC GATCGAGGTC GCGCCCGGCA CCACGGTGAC GCAGCGCTGG
GTGCCGGTGG AGCGGGTCGG CCTGTACGTG CCGGGCGGGC GGGTCGCCTA CCCCAGCAGC
GTCGTGATGA ACGTCGTGCC CGCCCAGGAG GCCGGGGTCG GCTCGCTCGC GGTCTTCTCC
CCGCCGCAGA AGGACAACGG CGGGCTGCCG CATCCGGTGA TCCTCGCCGC CTGCGCGCTG
CTCGGTGTCG AGGAGGTCTA CGCGGCCGGC GGAGCCCAGG CCGTGGCGAT GGCCGCCTAC
GGGACCACGA GCTGCCCGCC GGTCGATGTG ATCACCGGGC CCGGCAACGT CTACGTCACG
GCGGCCAAGC GCCTGCTGCG TGGGGTCGTC GGGGTGGACG CCGAGGCGGG CCCGACCGAG
GTCGCGATCC TCGCCGACGC CACCGCGGAT CCGGCCTTCG TCGCAGCCGA CCTCATCGCC
CAGGCCGAGC ACGACCCGAT GGCCGCCTGC CTGCTGATCA CCCCGTCGGT GGAACTCGCC
GACGCGGTCG ACGTCGAGCT CGGCAAGCAG GTGCCCGCCA CCCGCCACCG GGAGCGGGTG
GCGGAGTCGC TCGCCGGCCA GGGGGCGGTC GCGCTCGTCG CGGACGTCGA CGCGGGTCTC
GCGGTGGCGG ACGGCTGGGC CGCCGAGCAT CTGGAGATCC ACACCGCGGA CGCCGCCGCG
GTCGCCGCCC GGGTGCGCCA CGCCGGAGCG GTCTTCGTCG GCGACTACGC TCCCGTCCCG
CTCGGCGACT ACCTGGCCGG GTCCAACCAC GTCCTGCCCA CCGGCGGCAC CGCCCGGCAT
TCCAGCGGTC TGGCGGTCAC CGCCTTCCAA CGTCAGGTCC AGGTGGTCGA GTGCACGCGG
GAGGGGCTCG CCGCCGTCGC CCCGCGTATC GCCGCGCTCG GCGGGGCGGA GGACCTCACA
GCTCACGTCG ACGCGGTGGA GGTTCGACTC CGATGA
 
Protein sequence
MLRRLDLRGA VPAAAEIRRL LPRADVDVDV AVDAVRPVCA DVRDRGDAAV LAAAERFDGV 
RPPSLRVPAQ VLAAAADVLD PAVRDALTEA IRRTRLVHRA QLREPVAIEV APGTTVTQRW
VPVERVGLYV PGGRVAYPSS VVMNVVPAQE AGVGSLAVFS PPQKDNGGLP HPVILAACAL
LGVEEVYAAG GAQAVAMAAY GTTSCPPVDV ITGPGNVYVT AAKRLLRGVV GVDAEAGPTE
VAILADATAD PAFVAADLIA QAEHDPMAAC LLITPSVELA DAVDVELGKQ VPATRHRERV
AESLAGQGAV ALVADVDAGL AVADGWAAEH LEIHTADAAA VAARVRHAGA VFVGDYAPVP
LGDYLAGSNH VLPTGGTARH SSGLAVTAFQ RQVQVVECTR EGLAAVAPRI AALGGAEDLT
AHVDAVEVRL R