Gene Phep_3403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3403 
Symbol 
ID8254522 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4050670 
End bp4051743 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content47% 
IMG OID644937055 
Productglycoside hydrolase family 43 
Protein accessionYP_003093659 
Protein GI255533287 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3507] Beta-xylosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000194853 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACCA TACCGTTTTT CCTGGCTTTG ATCTGGCTTT TAATGGTAAC CTGCGGCAAA 
AAAAATGTGC CGCAGGAACC GCAACCAACT GGAACTGGTC CCAGCGCCAA CCCAGTACAA
TACAGAAACC CTGTATTTGA ACCAATACTT GCAGATCCAA GCATCATAAA AGGTACAGAT
GGCTGGTATT ATGGTTATGG AACAGAAGAT GATTGGGGCG ATGGAAAGGG AAACAGGCTC
GTACCGGTTG TACGTTCACT CGATCTGGTG AGATGGTCCC TGGTTGGAAA TGCTTTCACG
GCAAAACCTG TATGGAAAGC CGAAGGCGGT TTATGGGCAG TAGATGTGGT GCGGGTAAAC
AACCTGTACC ATATGTACTA TTCTTTCTCG CTTTGGGCAG ACCGCAATCC TGCCATTGGG
CTGGCTGTGG CCAATTCACC TGCAGGGCCA TTTGTAGACA AAGGAAAACT TTTTTTTAGT
GAGGAGATAG GGGTGCCTGT CTCGATTGAT CCGCATTACT ACGAAGAAAA TGGCAAAAAG
TATCTATTTT TCGGTAGTTA TAGCTCACAA ACCGTTCAGG GAACGTATGC AGTAGAGCTA
ACTGCAGATG GCTACGCCGT AAAAGACATC AATCAAAAAA CAAATATTGC TGCCGGGGAT
TTTGAAGCCG TAATGATTCA CAAAAGAGGC AACTACTATT ATTTTTTTGG CTCAAAAGGC
AGCTGCTGCG ATGGGGCTGC CAGCACTTAT AACGTCAGGG TCGCACGATC AGAAAATCTG
CTTGGCCCTT ACTTGGATCA AGCAGGTAAA AACATTGTGC AGCGAGGAAA TGGTACGCTG
ATCTTGCAGG GCAATTCCAA GTTTGCAGGT CCGGGACACA ATGCTAAAAT CATTACCGAC
AAAAACGGGA CGGACTGGCT CTTCTACCAT GCTATAGATA AAATGAACCC TAAGGTATCC
AGCGGTGCAA GCAGAAGAAT GCTGATGCTC GATAAGCTCA GTTGGGTAGA GGGATGGCCT
GTAATCGCCA ATGGGGTTCC GAGCGGCGAC ATCCAGGCTG GCCCTGAATT ATAA
 
Protein sequence
MKTIPFFLAL IWLLMVTCGK KNVPQEPQPT GTGPSANPVQ YRNPVFEPIL ADPSIIKGTD 
GWYYGYGTED DWGDGKGNRL VPVVRSLDLV RWSLVGNAFT AKPVWKAEGG LWAVDVVRVN
NLYHMYYSFS LWADRNPAIG LAVANSPAGP FVDKGKLFFS EEIGVPVSID PHYYEENGKK
YLFFGSYSSQ TVQGTYAVEL TADGYAVKDI NQKTNIAAGD FEAVMIHKRG NYYYFFGSKG
SCCDGAASTY NVRVARSENL LGPYLDQAGK NIVQRGNGTL ILQGNSKFAG PGHNAKIITD
KNGTDWLFYH AIDKMNPKVS SGASRRMLML DKLSWVEGWP VIANGVPSGD IQAGPEL