Gene Phep_3113 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3113 
Symbol 
ID8254231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3721279 
End bp3722853 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content45% 
IMG OID644936767 
Productglycoside hydrolase family 28 
Protein accessionYP_003093372 
Protein GI255533000 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5434] Endopolygalacturonase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00001805 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTAAAC TATTCTTAAT TATGCGGAAT TATTTATTCA TCCTACTGAC CAGCATTTCA 
ATCCTTACCC GGGCGGCCGA TGTAGAGGTC ACTACTTATG GAGCCAAAGG CGATGGACTT
ACTGTAAATA CCACTGCCAT TCAGAAAAGC ATCGATGCCT GTGCAGCATC GGGTGGGGGT
AAGGTAATCT TCCCTGCTGG TCATTTCATG AGTGCAACTG TAGTGCTAAA AAGTAATGTA
ACCCTTTACC TTTCCGACGG ATGTACTTTA ACCGGTGTTA AAGGTGCTGC CAATTACCCT
TACCAGCAGG TCAGCATTCC ATTTTATGGC GAAAACTGGG CAAAGCAGGC ACTGATCTTT
GCGCATAAAG CCGAAAATAT AGGTATTGAA GGTCCCGGAA CAATAGACGG ACAGGGGGCA
AGTTTTGAGA TCAATACCAT TAAAAAGCCA GACCGCTATA TGAACAGACC TTACCTGATC
TGGTTTGTAG CCTGTAAGAA AGTTAGCGTT AAACAGGTAC ACCTGCGCAA TTCTGCTTTC
TGGATGCAAC ATTACCTGGG TTGTGAAGAT GTAGTGATCG ATGGTATTTC TATTTGGAAC
CATTCCAATA AAAACAACGA TATGATCGAT ATTGATGGCT GTAAAAATGT AAGGATCACC
AATGTAAACG GCGATTCGGA CGATGATGGC CTGACTTTAA AAAGTACTTC CAAACTCATT
TCAGAGAATA TAGTGATCAG CAACTGCGTG TTTAGCAGTC ACTGTAACGG ACTCAAGTTC
GGTACCGAAT CTACCGGTGG TTTCAGAAAC GTTACCATCA GCAATATCGT CATCAAGCCT
TCGGCACAGC TGACCACCAT TTATGGTAAA CCGGCCGGCA ACAGTGGCAT TTCACTGGAA
ATGGTTGATG GGGGCATTAT GGAAAATATC AGTATCAGTA ACGTAGTCAT AGACGGGCCG
CAGGTACCGC TGTTTATCCG CCTGGGCAAC AGGGCAAGAA AACATACCGA TAATGCCGCT
GATCCGGGAA TTGGGGTAGC CAGGAACATC CATATCTCCA ATGTAACGGC TACAGGGGCC
GATATTACCG GCAGCTCAGT CATCGGACTT GAAAATGCCA TCATAGAAAA TGTTTCTTTA
AGCAACATCT CTATTTCCAG CTCTGGCGGG GCAAAGGCCG ATGCCATGTT CCGTAAACTG
GAAACCCTGG CCGATCATTA TCCAGAAGCC ACTATGTTTG GTCCACTGCC GGCTTACGGA
TTGTACGTAA AACATGTAAA AGGAATCCGC TTAAATGACA TCAGGCTTTC TTTTACCGGT
GATGACGAAC GTCCGGCCAT AGCCCTTGAA AATACAAGCG ATTTTGAGCT TAGGGGGCTC
AATATGGCCA GTACCGCCAA TACAAATACT GCAGTTTATC TGAAAAATGT AGCGAACGGC
TTTGTTACCG CAAATACTGT ATATACACCT GCAAAGTATT TTATCTGGAA AGAAGGCGCT
GTAGGCAATG TCTCGGTAAG CAATAACCAG CACCCGAAAA TTAAATCAGT AAGCAACCAA
ACTACAAAAA AATGA
 
Protein sequence
MLKLFLIMRN YLFILLTSIS ILTRAADVEV TTYGAKGDGL TVNTTAIQKS IDACAASGGG 
KVIFPAGHFM SATVVLKSNV TLYLSDGCTL TGVKGAANYP YQQVSIPFYG ENWAKQALIF
AHKAENIGIE GPGTIDGQGA SFEINTIKKP DRYMNRPYLI WFVACKKVSV KQVHLRNSAF
WMQHYLGCED VVIDGISIWN HSNKNNDMID IDGCKNVRIT NVNGDSDDDG LTLKSTSKLI
SENIVISNCV FSSHCNGLKF GTESTGGFRN VTISNIVIKP SAQLTTIYGK PAGNSGISLE
MVDGGIMENI SISNVVIDGP QVPLFIRLGN RARKHTDNAA DPGIGVARNI HISNVTATGA
DITGSSVIGL ENAIIENVSL SNISISSSGG AKADAMFRKL ETLADHYPEA TMFGPLPAYG
LYVKHVKGIR LNDIRLSFTG DDERPAIALE NTSDFELRGL NMASTANTNT AVYLKNVANG
FVTANTVYTP AKYFIWKEGA VGNVSVSNNQ HPKIKSVSNQ TTKK