Gene Phep_3866 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3866 
Symbol 
ID8255000 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4640605 
End bp4641867 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content45% 
IMG OID644937530 
Productglycosyl hydrolase family 88 
Protein accessionYP_003094119 
Protein GI255533747 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0129916 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACGAT TTAAAAACCT TCCGCTCTTA TTTTTCTTTC TCCTGATAGG GGTAAATTAC 
AGTCATGGGC AGATGAAGCG CCTTAAGCCT GAAAAAGAGA TGTTAAAGCT GGCAGACCAG
GTTCTGGCGC AATCTGTGGC CCAATATAAA ACCATGATGC GGCAATTACC GCAGGGGCAG
TTGCCACGCA CTTTCGAAAA CGGGAAGCTG GCTACGGCAA GCCCTTATTC CTGGATCAGT
GGCTTTTATC CGGGTTCACT GATGTACCTC TACGAATATT CGCGCGACAG CGTTTTGTTA
AAAGATGCAG AAATAAGGCT AAAGGACCTG GAAAAGATAC AATATGTAAC AGCCAACCAT
GATCTTGGCT TTATGATGTA CTGTAGTTTT GGAAATGCTT ACAGATTGCT CGACAGCACC
CGTTACAAAG ATATACTCGT CCAATCTGCC AAATCCCTCG TCTCCAGGTT TTATCCAAAA
ACGGGCTGTA TCAAATCATG GAACAAGATC AAGTCCCTGG ATGGGAAAAG AATGCTGAAT
TTCCCGGTGA TCATTGATAA CATGATGAAC CTTGAGCTTT TGTTTTTTGC TTCAAAAGTT
ACCGGCGACC CCTCTTATAA AGATATTGCC ATTAAACATA CAGAAACGAC GCTTAAAAAC
CATTTCAGAC CTGATTACAG CAGTTATCAT GTAGTAGACT ATGATGAGGA AACAGGTGCG
GTTAAAAGTA AAGAGACTAT GCAGGGATTT TCTGACAATT CAACCTGGGC AAGAGGACAG
GCCTGGGCCA TATATGGATT TACCATGGTA TACCGGGAAA CGAGGGTGAA AAAATACCTT
GAAGCAGCAC AAAAGATGGC TGCGCATTTC ATCAACCATC CCAACCTGCC CAAAGACAAG
ATTCCTTACT GGGATTTTAA TGTCAACCAG CCTGGGTTTA TCCCGCCATG GAAATATGAT
CCGGCTAAAT ATCGGGAAAT ACCCAGGGAT GCTTCTGCTG CAGCTATCGT TTCTTCGGCG
CTGATAGAAC TGGCCGCTTA TGTGGATGCT GCAACAGCAC AAAAATATTT GACGATTGCC
GAAACCATGC TGAAGTCCCT GTCTTCTGCA CAATACCGTT CTGCTGCCGG AGCCAATGGC
GGTTTTATTC TGATGCATTC TTCAGGAGGT GTACCAGGAA ATATAGAAGT GGATGTCCCG
GTTTCCTATG CGGATTATTA TTATCTGGAA GCCTTGATGC GATATCGTGC AATGGGCGCC
TGA
 
Protein sequence
MGRFKNLPLL FFFLLIGVNY SHGQMKRLKP EKEMLKLADQ VLAQSVAQYK TMMRQLPQGQ 
LPRTFENGKL ATASPYSWIS GFYPGSLMYL YEYSRDSVLL KDAEIRLKDL EKIQYVTANH
DLGFMMYCSF GNAYRLLDST RYKDILVQSA KSLVSRFYPK TGCIKSWNKI KSLDGKRMLN
FPVIIDNMMN LELLFFASKV TGDPSYKDIA IKHTETTLKN HFRPDYSSYH VVDYDEETGA
VKSKETMQGF SDNSTWARGQ AWAIYGFTMV YRETRVKKYL EAAQKMAAHF INHPNLPKDK
IPYWDFNVNQ PGFIPPWKYD PAKYREIPRD ASAAAIVSSA LIELAAYVDA ATAQKYLTIA
ETMLKSLSSA QYRSAAGANG GFILMHSSGG VPGNIEVDVP VSYADYYYLE ALMRYRAMGA