Gene Phep_2077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2077 
Symbol 
ID8253181 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2393759 
End bp2394760 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content45% 
IMG OID644935725 
Productaldo/keto reductase 
Protein accessionYP_003092344 
Protein GI255531972 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.97994 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.282857 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCTATA CACCTGATCC AGCCAGATAT ACGCAAATGA AATATCGCCG CTGTGGCAAT 
AGCGGATTAA AATTATCTGC CATTTCCCTG GGCTTATGGC ATAATTTCGG ACATGTTGAT
CAGCTTGAAA ACTGTAGCAA CATCCTGAAA CTTGCTTTTG ACAGTGGCAT TACCCATTTT
GATCTGGCTA ACAATTACGG GCCACCCCCG GGTGCTGCCG AAGAAAATTT TGGCCTGCTC
TTAAAAAGGG ATTTTCCAGG TTACAGAGAT GAAATGATCA TTTCAACAAA GGCGGGCTAT
ACCATGTGGG ATGGACCTTA TGGCGACTGG GGATCTAAGA AATACCTCGT TTCCAGTTTA
GACCAGAGCC TGAAAAGACT TCAACTGGAT TATGTGGATA TCTTTTACCA CCATCGACCG
GACCCCGAAA CCCCGCTGGA AGAGACGATG TCTGCCCTGG ACCTGATCGT CCGCCAGGGT
AAAGCACTTT ACATCGGAAT TTCTAATTAT AAACCTGCAG AAGCTGCAAC TGCAATACAG
CTTTTAAAGG AGCTGGGTAC ACCCTGTATT ATTCATCAAC CTAAATACTC CATGTACGAA
CGCTGGATAG AGGGGGGATT GCTGGAGCTG CTCGGAAATC AGGGAGTAGG TTGCATACCT
TTTTCTCCAC TTGCGCAGGG ATTGTTAACG GATAAATATC TTAAAGGGAT ACCTGCTGAT
TCAAGGGCAG CAAAAACATC TGGTGCATTA CAGCCAGATC AGATTACAGC AGAACGGCTT
CGGCAGTTAA ACCAGCTGAA TGAGCTGGCA CAGTCACGGG GACAAAAACT TGCGCAAATG
GCCTTATCAT GGATATTACG TGATGAGCGT GTGACTTCAG TATTGGTAGG GGCAAGCAAA
CCTGAACAAC TTGCTGATTC TTTAAAATGT CTGGACAATA CCACCTTCAG TACAGCCGAA
TTACATCAAA TTGATTTGAT ACTTTCCGGT TCATCAACCT GA
 
Protein sequence
MSYTPDPARY TQMKYRRCGN SGLKLSAISL GLWHNFGHVD QLENCSNILK LAFDSGITHF 
DLANNYGPPP GAAEENFGLL LKRDFPGYRD EMIISTKAGY TMWDGPYGDW GSKKYLVSSL
DQSLKRLQLD YVDIFYHHRP DPETPLEETM SALDLIVRQG KALYIGISNY KPAEAATAIQ
LLKELGTPCI IHQPKYSMYE RWIEGGLLEL LGNQGVGCIP FSPLAQGLLT DKYLKGIPAD
SRAAKTSGAL QPDQITAERL RQLNQLNELA QSRGQKLAQM ALSWILRDER VTSVLVGASK
PEQLADSLKC LDNTTFSTAE LHQIDLILSG SST