Gene Snas_3997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_3997 
Symbol 
ID8885198 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp4265226 
End bp4266515 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content72% 
IMG OID 
Producthistidinol dehydrogenase 
Protein accessionYP_003512742 
Protein GI291301464 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.327583 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.821981 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGAAAC GCATCGATCT ACGAGGGTCC ACCGACATTC CCGCGGGGTT GCTTCCCCGG 
GCGCGCTTCG ACGTCGCCGC CGCGCTGGCC GAGATCCAGC CGCTGCTGGA CGCCATCGCG
CATCACGGTG CCGACCCGAT CCGACGCGTG ACGGCCGAAC GCGACGGCGT CGAGCTGGAC
AGCCTGCGGG TACCCGCCGA GGCCCTGCGC GACGCACTGG ACTCATTGGA CCCCGCGCTG
CGGGGTTCGC TGGAGGAGGC CGTTTCGCGG GTGCGCGCGG TGCACTCCGC CCAGCGTCGC
GAGAACCTCG TCACCGAGGT CGCTCCGGGC GGCGTGGTCA CCGAGCGGTT CGTGCCGGTG
CGGCGCGTCG GGCTCTACGT CCCCGGCGGG CTGGCACCGT TGGCGTCCAG TGTGGTCATG
AACGTGGTGC CGGCGCAGCT GGCCGGGGTG CCGCAGATCG CGGTCGCCTC GCCGCCGCAG
CGCGCCACCG GACTGCCCGA CGTCACGATC CTGGCCGTGT GCGCGATGCT GGACGTCACC
GAGGTCTACG CCGTCGGCGG TCCCGCCGCC ATCGGCATGT TCGCCTACGG CGCCGACGAA
TGCGCGCCGG TCGACATGAT CACCGGCCCG GGCAACATCT ACGTCACCGC CGCCAAACGC
GCGGTGCGGG GCCTGGTCGG CATCGACGCC GAGGCCGGAA CCACCGAGAT CGCGGTGCTG
GCCGACGACA CCGCCGACGC CGCCCACGTG GCCGCCGACC TGATCAGCCA GGCCGAACAC
GACCCCGAGG CCGCCAGCGT GCTCGTCACG CCCTCGGCGG CCCTCGCCGA CGCCGTCGAC
GCCGAAGTCA AGGCCATGGT CGACGAAGCC CGTCACGCGC AACGGATCCG GATCGCGCTG
TCGGGCCCGC AGTCGGGCAT CGTCCTGGTC GACGACCTCG AACAAGGACT GTCGGTAGTG
GACGCCTACG CGGCCGAGCA CCTGGAGATC CAGACAGAGG GCGCCCGCGA ACTGGCGATG
CGGGTCACCA ACGCCGGAGC CATCTTCGTC GGCCCGTACT CGCCGGTCTC GCTCGGCGAC
TACTGCGCCG GGTCCAACCA CATCCTGCCC ACCGGCGGCT GCGCCCGGCA CTCGTCCGGC
CTGTCCGTCA CGAGCTTCCT GCGGCCGATC CAGGTCATCG AGTACGACCG CGAGGCCCTG
GCAGCCGTCA GCGACGACGT GGTCCGGCTG GCCGAGGCCG AGAACCTGCC CTCGCACGGC
AAGGCGGTCA CGGCGAGGTT CGGCCGGTGA
 
Protein sequence
MLKRIDLRGS TDIPAGLLPR ARFDVAAALA EIQPLLDAIA HHGADPIRRV TAERDGVELD 
SLRVPAEALR DALDSLDPAL RGSLEEAVSR VRAVHSAQRR ENLVTEVAPG GVVTERFVPV
RRVGLYVPGG LAPLASSVVM NVVPAQLAGV PQIAVASPPQ RATGLPDVTI LAVCAMLDVT
EVYAVGGPAA IGMFAYGADE CAPVDMITGP GNIYVTAAKR AVRGLVGIDA EAGTTEIAVL
ADDTADAAHV AADLISQAEH DPEAASVLVT PSAALADAVD AEVKAMVDEA RHAQRIRIAL
SGPQSGIVLV DDLEQGLSVV DAYAAEHLEI QTEGARELAM RVTNAGAIFV GPYSPVSLGD
YCAGSNHILP TGGCARHSSG LSVTSFLRPI QVIEYDREAL AAVSDDVVRL AEAENLPSHG
KAVTARFGR