Gene Phep_2238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2238 
Symbol 
ID8253344 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2590649 
End bp2591860 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content45% 
IMG OID644935887 
Productglycosyl hydrolase family 88 
Protein accessionYP_003092504 
Protein GI255532132 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1331] Highly conserved protein containing a thioredoxin domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0386337 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.825384 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGAA ACCTGATTTT AAAAAGCCTG TTTGTACTGA CCACAGTTTC CTTTGCGATA 
GGGGGATGTG GTGTCCGTAA AGCGAACCAA AACAGCAATG ACCTTGCGCA AATTGTTGAT
AAAGACATTA AAGCTGCAGA AGAACAGTAT AAATACTTTA TGAAGCAGAT CCCTGCAGAT
AAACTGCCAC GTAGCCTGGA TAAAAACGGG AAACTGGTGA CCAGCAATTC TGAATGGTGG
TGCAGCGGAT TTTATCCGGG TACGCTCTTG TACCTGTATG AACTGGGCAA AGACCCGGTA
CTGTATACCG AGGCGCTAAA CCGGTTAAAG CTTTTGGAAA AAGAACAGTT TAACAAAAGC
ACGCATGACC TGGGCTTTAT GATGTACTGC AGTTTTGGAA ATGCCAACCG TTTGAAACCA
TCAGCGGCCT ATAAGCAGAT CCTGATTAAC AGTGCCCGTT CCCTGGCCAG TCGTTTTAAC
CCCAAAGTGG GCTGTATCCG TTCCTGGAAC TCTAAAGACC CATCCGAATT TAAGGTGATC
ATTGACAATA TGATGAACCT GGAACTGTTG TTCTGGGCAG CTAAAGAAAC AGGTGACAAA
TCTTTTTACG ACATAGCAGT TACGCATGCC AATACTACGA TGAAAAATCA TTTCAGGCCC
GACTTCAGTT CCTATCACCT GGTGATATAC GACAGCAATA CCGGTGCCGT ACGTAAAAAA
CAAACGGTGC AGGGCTATGC AGATGATTCT GCCTGGGCAA GGGGACAGGG CTGGGGCCTA
TATGGTTATA CAGTGATGTA CCGGGAAACT AAAGACACCA GGTACCTGGA ACTGGCCAAA
AAGATAGCTG GCTTTATACT TGACAACCCT AAGCTGCCTG CAGATAAAAT TCCATACTGG
GATTTCAATG CACCGAATAT TCCGGATGCT TCGAGAGATG CCTCTGCCGG TTCGCTGATT
GCATCGGCTT TGCTGGAACT GGCCGGCTAT ACTGATAAAG CACTGGCTGA TCAATACGTT
TCGGCCGCTG AACTCATGAT CCGTTCACTG TCAAAACCTC CTTATCAGTC TTTATACGGA
GAAAACAGCG GTTTTCTGCT AACCAAAAGT GTAGGTCACC TGCCTGGGAA ATCTGAAGTG
GATGTGCCGC TTACCTATGC AGATTATTAT TATGCGGAGG CCTTACTGCG TTATAAAAAA
CTACAGAAAT AA
 
Protein sequence
MKRNLILKSL FVLTTVSFAI GGCGVRKANQ NSNDLAQIVD KDIKAAEEQY KYFMKQIPAD 
KLPRSLDKNG KLVTSNSEWW CSGFYPGTLL YLYELGKDPV LYTEALNRLK LLEKEQFNKS
THDLGFMMYC SFGNANRLKP SAAYKQILIN SARSLASRFN PKVGCIRSWN SKDPSEFKVI
IDNMMNLELL FWAAKETGDK SFYDIAVTHA NTTMKNHFRP DFSSYHLVIY DSNTGAVRKK
QTVQGYADDS AWARGQGWGL YGYTVMYRET KDTRYLELAK KIAGFILDNP KLPADKIPYW
DFNAPNIPDA SRDASAGSLI ASALLELAGY TDKALADQYV SAAELMIRSL SKPPYQSLYG
ENSGFLLTKS VGHLPGKSEV DVPLTYADYY YAEALLRYKK LQK