Gene Phep_3769 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3769 
Symbol 
ID8254901 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4514835 
End bp4515860 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content43% 
IMG OID644937431 
ProductAlcohol dehydrogenase GroES domain protein 
Protein accessionYP_003094022 
Protein GI255533650 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.109016 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACAAG CTACATTAAT TTCACCGGGG AATATTGTTT TTTCTGAAGT GGGTACACCA 
ACTGAATTAG GGGATAATGA TGTTTTAATA AACATAAAAA AAATAGGTAT TTGCGGCTCT
GATATCCATG CTTATAAAGG GAAGCACCCT TTTACACCTT TTCCTGTAAT CCAGGGGCAT
GAATACAGCG GAGAGGTAGT TGCGATAGGG GCAGAGGTGC GATCTGTTAA GATTGGTGAT
AAGGTGACAG GCAGGCCTCA GCTGACTTGC GGTACATGCG GGCCTTGTTG TGCCGGATTG
TACAACGTAT GCGCTAATCT GAAAGTTGAA GGTTTTCAGG CACCGGGTAC GGCGAGGGAC
TATTTTGTGC TGCCCGAAGA CAGGACTTAT GTTGCGCCTG ATCAGGTTAG TTATGATGAA
ATTGCTTTGC TGGAGCCTGC AGCTGTAGCG GCGCATGCCA CGGCCATGAT CAGGGATATT
GAAAACAAAA ATATTGTGGT AACTGGTGCA GGTCCGATTG GTAACCTGAT TGCGCAGTTT
GCAAAAATCC GTGGTGCCAA ACGCGTGATT GTTACAGATT TCAATGATTT TCGTTTAAAC
CAGTTAAAGC AAACCGGTAT CGGCGATCTG ATAAATCTGA GTGTTGAAAC ATTTGAAGAA
GGAATTGACC GCATACTAAA AGGAGAAAGC TTCCAGGTAG GAATAGAAGC CGTAGGTGTG
GAGCCGGCAC TATATAACCT GATCAACAAT ATAGAAAAAG CTGGACAGGT ACTTATTGTT
GGTGTTTACG AAGAATTTCC GAGGTTAAAC ATGGGCTTTG TTGGTGAGCA TGAACTTTCT
ATCCAGGGTT CTATGATGTA TAAGCACGAA GATTATCTGC AGGCGCTGAA TTTTGTGGTG
TCGGGACAAC TCGTGTTGAA AACCCTGATT ACGCACCGGT TTGATTTTCA GGATTATAAT
GAAGCTTATG CATTTATTGA GAAGAATGCA TCACAAACTA TAAAAGTATT AATTGACGTA
AATTAG
 
Protein sequence
MKQATLISPG NIVFSEVGTP TELGDNDVLI NIKKIGICGS DIHAYKGKHP FTPFPVIQGH 
EYSGEVVAIG AEVRSVKIGD KVTGRPQLTC GTCGPCCAGL YNVCANLKVE GFQAPGTARD
YFVLPEDRTY VAPDQVSYDE IALLEPAAVA AHATAMIRDI ENKNIVVTGA GPIGNLIAQF
AKIRGAKRVI VTDFNDFRLN QLKQTGIGDL INLSVETFEE GIDRILKGES FQVGIEAVGV
EPALYNLINN IEKAGQVLIV GVYEEFPRLN MGFVGEHELS IQGSMMYKHE DYLQALNFVV
SGQLVLKTLI THRFDFQDYN EAYAFIEKNA SQTIKVLIDV N