Gene Phep_2749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2749 
Symbol 
ID8253857 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3247888 
End bp3251307 
Gene Length3420 bp 
Protein Length1139 aa 
Translation table11 
GC content47% 
IMG OID644936397 
Productglycoside hydrolase family 2 sugar binding 
Protein accessionYP_003093012 
Protein GI255532640 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0806564 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTTGA ACTTAAGTAA AACCATAGCA TTCGTTTTAT TGACTTGCAG CCTGCAGCCT 
GGCGCAGCTA TGGCGCAGGC AAATGAGTCT GCTAAGCCCT GGGTATTATG GCACTGGATT
AAAGGCGGGG TTTCAAAACC AGGTATTACC GCCGATCTGG AAGCCATGAA GTCAGCGGGC
ATCGGAGGTG CATATCTTTT ATCGATTAAA GATGTACCCA ATCCCCCTTT ATTTAACCCA
TCTGTACGGC AGCTTACCCC ACAATGGTGG GACATGGTAA CCTTTGCCAT GCAGGAGGCC
AGGCGTTTAA ACCTGAAACT GGGGATGCAT GTAAGTGATG GCTTTGCACT TGCAGGAGGG
CCCTGGATTA CACCTGAACT TTCTATGCAG AAAGTGGTAT CTACCCAATT GAATATCAAG
GGTGAAACAG CTGAAAAAAT AAAACTGCAG CAACCGGAAA CCCTGGAAGG TTATTATAAA
GACATTGCGG TATACGCCTA TCCCTCAGCT GAAGGTGCCG GTATTTCCAC ACAGACTGTG
ATTCCCGAAA TTACGACCAG TAATGCTGCA GATGCAAGCG GACTGATCAG ACCGGGTAAT
ACGAAAAATT TTGGAAGTAA TGAAGCCTGC TGGATTCAGT ATGCTTTTAA ACAGCCTTTT
ACCACCCGCA CCATCCGCAT AAGTACTGGC AGCAATAACT ACCAGGCACA ACGGCTGGAA
ATACAGGTTA GTGATGATGG TGAAAATTTC CGTTCGGTGG GGCGCCTGGA GCCGCCAAGG
CATGGCTGGC AGGACACTGA TGCCGATGTG ACACATAGTA TTGTACCCGT TACCGCAAGA
TATTACCGCT TCGTATACGA TAAAAAAGGA TCGGAGCCCG GTTCAGAAGA TCTGGACGCC
GCCAAATGGA AGCCTTCTTT AAAACTGAGG AACCTGGAGC TTTCTGCGGA AGCCAGGATC
AACCAGTTTG AGGGAAAAGC TGGCTTGGTA TGGCGCATCA GCAAAGCCAG CACAAAAGAG
ACGCTGGAAG ACCATTTATG TGTACCGATA GATAAGATCA TCAACCTGAG CGATAAAATT
AAAGCAGACG GCAGCTTAGA CTGGAAGGCG CCTAAAGGAA ACTGGACCAT ATTGAGGATT
GGCCATACTT CCACCGGTCA TAAGAATGCC ACTGCCGGAG CAGGTATGGG GCTGGAATGT
GATAAATTTA ATCCGGCTGC AGTAAAATTA CAATTTGACA GCTGGTACGG GGCAGCCTTA
AAACATGGAG GACCTGAAAT CGCCTCAAAG GTTTTGAATG AATTGTTTGT TGACAGCTGG
GAGTGTGGCA GCCAGAACTG GTCGCCACTT TTTGCAGCGG AATTTAAGAA ACGCAGGGGC
TACGACCTGA TGCGTTATCT GCCGGTTATG GTAGGCATAC CGCTTGGAAG TGTTGAGCTG
TCTGAAGCCT TTTTACATGA TGTGCGAAAA ACCATTGCTG AACTGGTTGC AGACCAGTTT
TACTACACCT TATCCAGCCT GACCAAAGAG AAAGGTGTCA CTTTTGCCGC TGAGAATGTT
GCACCGACCA TGCTGAGCGA TGGCCTGCTG CATTATAAAA ATGTAGACAT GCCGATGGGT
GAGTTCTGGT TGAACAGTCC GACACACGAT AAACTGAACG ACATGCTGGA TGCGGTTTCC
GGCGCGCACA TTTATGGCAA AAATATCATA CAGGCAGAAG CTTTCACCAC CATTAGGATG
GATTGGAATG AACATCCCGG CAATATGAAG ACCTTACAGG ATAGAAATTA TGCGCTGGGC
ATCAATAAAC TGGTATATCA TGTTTTTGCC CATAATCCAT GGCTGGACCA AAAGCCAGGG
ATGACACTGG ATGGGATAGG TTTATATTTT CAGCGTGACC AGACCTGGTG GAAACTTGGA
AAGGCCTGGG TAGATTATGC CACACGTTGT CAGGCACTTT TACAACAGGG TAAGCCTGTA
GCCGATGTCG CGGTATTTAT TGGTGAAGAA TTGCCCAGCC GCTCGGTATT ACCTGATCGG
TTGGTTAACC TGCTGCCTGG CCTTTTTGGT GCTGAAGTGG TGGCAGCTGA GGCAAAAAGA
CTGGCCAATA CCGGCGAGCC CTTAAGACAA AAGCCTGCAG GGGTAACACA TTCGGCAAAC
ATGGCCGATC CGGAAGACTG GGTAAATCCA TTAAGAGGCT ATGCCTACGA TTCTTTTAAT
CCTGATGTAT TGCTTGACGC AAGTGTAAAA GACGGGAGAG TGGTGTTTGC CAGCGGTGCA
AGTTATGCGG TACTGGTTAT CCCGGGTAAA ATGCTCCTGA ACCCAAATTA TCAGTATATG
AGTAAGGAAG TGGCTGCAAA GCTGAACGAG CTGGCGAAAG CAGGTGCAAG CCTTGTGCTC
GGTGAACGTC CGAAATTCCA GCTGGGTATG CCTAAAACAG GCAGTGATCA GGACTTTGAG
ACTTTACTGA CCGAGCTTTG GGATGGTAAT TTTAAAACAG CAGGACAGGG GCAACAGCAG
CTTTCCCTGA AAACACTGGG TAAGGGAAGG ATCATTAAGG GTGCTTATCA GGCAGAAAGT
TTTGACCTTA TTGGCCTGGA AAGGGATCTG CTGGTAATGG AAGGAAATGG AGATTATGCG
AAAAAGGTAG CTTATACCCA TAGGATTGCA GCCGATAAAG AGTTTTATTT TGTTGCTAAC
CAGGAAAATA AAAAGCGCTT GCTGGAGTTT TCTTTTAGAA CAGCCGGTAA AATACCTGAA
CTTTATGATG CTGTAACTAA TGAAACTATT GCCCTGAAAG CCTGGAAAAC TGAGGCTGGT
CGTACAAGAA TGCAGCTAAA GCTGGCCCCA AATGCCTCCG CCTTTGTCAT CTTTAAAGAT
AGAGCGGCTA CTACTGCCTC TGCTACAGGC AAAAACTGGA AAGAACAGCA ATTGGTAAAC
ACGATAAATG GGAGCTGGAA AGTACGTTTT GACCCTGCTT ATGGCGGTCC GGAAAAGCCG
GTAACTTTTG CTGCGCTGAG CGATTGGAGT AAACATCCGG ACAGCCTGAT CAGATATTAC
TCAGGTTCAG CGGAGTATAG TAATAATTTT AAAGTTAAAA AACAGGCCGG GCAGCAATAC
TTGCTGGATT TGGGAACGGT AGGCTGTATC GCAGAGGTTC GGGTAAACGG AATTTCTTGT
GGTGTGGTCT GGACTGCGCC TTACCAGGCC GACATTACAG CTGCTCTAAA AAATGGGGAC
AATGTGCTGG ACATAACGGT AACCAATACC TGGGCCAACA GGATCATCGG CGATCAGCGT
TTGCCGGAAG ACAGGAGAAT CACAAAAACA AATGCGCCAT ACCGTTTGGA AGGCAAGCCA
CTGAATGAGG CTGGCTTGCT TGGTCCGGTA AGGCTATTTA AACAGGAACA ATCAAATTAA
 
Protein sequence
MNLNLSKTIA FVLLTCSLQP GAAMAQANES AKPWVLWHWI KGGVSKPGIT ADLEAMKSAG 
IGGAYLLSIK DVPNPPLFNP SVRQLTPQWW DMVTFAMQEA RRLNLKLGMH VSDGFALAGG
PWITPELSMQ KVVSTQLNIK GETAEKIKLQ QPETLEGYYK DIAVYAYPSA EGAGISTQTV
IPEITTSNAA DASGLIRPGN TKNFGSNEAC WIQYAFKQPF TTRTIRISTG SNNYQAQRLE
IQVSDDGENF RSVGRLEPPR HGWQDTDADV THSIVPVTAR YYRFVYDKKG SEPGSEDLDA
AKWKPSLKLR NLELSAEARI NQFEGKAGLV WRISKASTKE TLEDHLCVPI DKIINLSDKI
KADGSLDWKA PKGNWTILRI GHTSTGHKNA TAGAGMGLEC DKFNPAAVKL QFDSWYGAAL
KHGGPEIASK VLNELFVDSW ECGSQNWSPL FAAEFKKRRG YDLMRYLPVM VGIPLGSVEL
SEAFLHDVRK TIAELVADQF YYTLSSLTKE KGVTFAAENV APTMLSDGLL HYKNVDMPMG
EFWLNSPTHD KLNDMLDAVS GAHIYGKNII QAEAFTTIRM DWNEHPGNMK TLQDRNYALG
INKLVYHVFA HNPWLDQKPG MTLDGIGLYF QRDQTWWKLG KAWVDYATRC QALLQQGKPV
ADVAVFIGEE LPSRSVLPDR LVNLLPGLFG AEVVAAEAKR LANTGEPLRQ KPAGVTHSAN
MADPEDWVNP LRGYAYDSFN PDVLLDASVK DGRVVFASGA SYAVLVIPGK MLLNPNYQYM
SKEVAAKLNE LAKAGASLVL GERPKFQLGM PKTGSDQDFE TLLTELWDGN FKTAGQGQQQ
LSLKTLGKGR IIKGAYQAES FDLIGLERDL LVMEGNGDYA KKVAYTHRIA ADKEFYFVAN
QENKKRLLEF SFRTAGKIPE LYDAVTNETI ALKAWKTEAG RTRMQLKLAP NASAFVIFKD
RAATTASATG KNWKEQQLVN TINGSWKVRF DPAYGGPEKP VTFAALSDWS KHPDSLIRYY
SGSAEYSNNF KVKKQAGQQY LLDLGTVGCI AEVRVNGISC GVVWTAPYQA DITAALKNGD
NVLDITVTNT WANRIIGDQR LPEDRRITKT NAPYRLEGKP LNEAGLLGPV RLFKQEQSN