Gene Phep_3872 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3872 
Symbol 
ID8255006 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4651245 
End bp4652861 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content45% 
IMG OID644937536 
Producthypothetical protein 
Protein accessionYP_003094125 
Protein GI255533753 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5520] O-Glycosyl hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.875392 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATATA TGCGTGCTTC TTTGCCATCC TTTGTGCTCA TCATTTTGAT GTCCGTATTT 
AATCCTTTAG TCTTTGCCCA GAAGAAAATA ATAAAAAATC CTGGTCCTGG TGCTCCTCAA
AGCATCAGGA TCAGTTTAAC TGAAGAAAAG CAGACCATTC ACAGTTTTGG GGCCTCCGAT
TGCTGGTCGG CAAAATATGT GGGCAAGTGG TCCGATCTGC AGAAAAAGAA CCGTATTGCT
GATCTTTTAT TCAGTACAGA TACGACTTCA GATGGCAAGC CGAAAGGTAT AGGCCTGTCT
TTATGGCGGT TTAACATTGG ATCGGGTAGT TACGAACAAG GTGCGGAAAG TAATATTCCT
GATGAATGGC GCAGGGAAGA ATGCTTTTTA AATGAAAACG GTACTTATAA CTGGGAAAAA
CAGAACGGGC AGCAATGGTT TCTGAAAGCG GCTAAACAAC GGGGGGTAGC TTATACTCTG
GGTTTCTCGC TTACTCCTCC TGCTTTTATG AGCGGGAATG GGAAGGCCTA TAACAACAGC
AATACACCGA ACCTGAATAT AAAGGCCGGT ATGACAAATG CTTATGCAGC ATTCATGTCT
GCGGTTTCTG CTCATTTTAA ATTTGACTTT TTAAGCCCCG TAAATGAACC GCAGTGGTTC
TGGGGAAGGG ACAGGATCAG TCAGGAAGGT TCACAGGCAA CGAATGCAGA AATAGCAGAC
CTGGTAAAGG TGCTTTCTGT TCAACTTCCT TCAAAGAGTC CAGGAACGCA GGTGGTTATT
GGGGAAGCCG GGCAATGGGA TTTTCTTTAT GGAAAGAATA CAGATGGCCG GGGTGATCAG
ATCAGTGAGT TCTTTTCGCC CGCTTCTGCT CATTATATTG CTGATCGTCC AAATGTTAAG
CGTATTATTT CCGGACACAG TTATTTTACA ACATGTCCGG ATAACAACCT GATCCATGTG
CGTGAGCAGG TTGCTGCAAA GGCCAAACAA ACAGACCCCT TGCTGGAAGT ATGGCAAACG
GAATTCGGCA TACTTGGCAA CATCTGCAAC CAATACAATG GCGGCCCGCG AAATACCGGG
ATCGATTATG GGCTGTATGT AGCCAAGGTG ATCCATCACG ACCTTACCAT TGCCAATGTT
ACTTCCTGGC AATGGTGGCT GTCCATCAGT CCTTATAATT ACAGTGATGC CCTCGTGTAT
ATCAATGACC CTTCGGGGAT GATCAATGCC GCAGGATGTA AAAATGATGG TGAGGTATTA
GAGTCCAAAC AACTATGGGC GATGGGAAAT TATTCCCGTT TTATCCGTCC GGGAATGAAG
CGTATCGGGG TAAAGACAGA TGGTATTGCC GGTCCGGTTG AAGGGGCAGC TTCTCTCATG
GTTTCTGCTT ATAAAGATGA AACCGCAAAA AAAATAGTAG TTGTAATTGT TAATCCGGAG
CAAAAGGAAA AAGAATTTCA GCTTAGGGCC GCAGGTACTG TCTTTAAACT GGCAGGGAAC
CTGCTGAACG TATACACTAC TGATGCACAA AATAACCTTA AAAAAACAAC TGCTTCGGCA
GAGAAAGTAA AGATCGTTCC CCGGTCGGTA GTTACACTTG TTGGCACTTA CCATTGA
 
Protein sequence
MEYMRASLPS FVLIILMSVF NPLVFAQKKI IKNPGPGAPQ SIRISLTEEK QTIHSFGASD 
CWSAKYVGKW SDLQKKNRIA DLLFSTDTTS DGKPKGIGLS LWRFNIGSGS YEQGAESNIP
DEWRREECFL NENGTYNWEK QNGQQWFLKA AKQRGVAYTL GFSLTPPAFM SGNGKAYNNS
NTPNLNIKAG MTNAYAAFMS AVSAHFKFDF LSPVNEPQWF WGRDRISQEG SQATNAEIAD
LVKVLSVQLP SKSPGTQVVI GEAGQWDFLY GKNTDGRGDQ ISEFFSPASA HYIADRPNVK
RIISGHSYFT TCPDNNLIHV REQVAAKAKQ TDPLLEVWQT EFGILGNICN QYNGGPRNTG
IDYGLYVAKV IHHDLTIANV TSWQWWLSIS PYNYSDALVY INDPSGMINA AGCKNDGEVL
ESKQLWAMGN YSRFIRPGMK RIGVKTDGIA GPVEGAASLM VSAYKDETAK KIVVVIVNPE
QKEKEFQLRA AGTVFKLAGN LLNVYTTDAQ NNLKKTTASA EKVKIVPRSV VTLVGTYH