Gene Phep_3977 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3977 
Symbol 
ID8255111 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4794373 
End bp4796004 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content38% 
IMG OID644937641 
Productglycoside hydrolase family 28 
Protein accessionYP_003094230 
Protein GI255533858 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5434] Endopolygalacturonase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.722531 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAATA GATTTAAACT GTTAATGATA ATATTTCTTG GATTCATTTA TAGTTGCAGG 
ACTGAGAAAA ACTTTAATAT ACTTGACTTT GGAGCCGTCC GGGACGGGAA GACCTTAAAC
ACCGCCGCTA TTCAAAAGGC AATAGATGAA TGTAGCAAAA AGGGCGGACG GGTTGTAATT
CCGAAAGGTG TTTACCTGTC AGGTACTTTA TACATGAAAA GCAATGTAGA ACTACATATT
GAGGAAGGGG CGATACTTAA AGGCAGCGCT TCTTTTAAAG ATTATCCTGA CAACAAAGTG
ACCTACAAAA ATGCCTTTAC ACATTTTGAA GACGGAAAAC TTTATGCTAA TAAAGCATTT
ATTTTTGCGG AAAATGTGAG TAACATCTCG TTTACTGGTA AGGGAACTAT TAACGGCAGT
GGCGACAGTC CCGAATTTAA TCTTGGAAAT GACGATACGT CTATAAGCCG TTCAAGACCC
TGCATGTTGC TAATTATTGA CAGCAAGCAC ATTAAGCTGA ATGATTTGAC TTTAGAGAAC
TCAGCATACT GGCTACAAAA CTATCTGGGC TGTGAATTTC TTGAGCTAAA AGGTTTAAAG
ATTTATAACC AGTCAAACTA TAATCAGGAT GGAATGGATA TTGACGCCAA ACACGTACTG
GTAGAAGGCT GTACCTTAGA TGTTGATGAT GATGGTATTT GTTTTAAAAG TCATGATCCG
AAACGAATTG TAGAAGATGT GGTGGTCCGA AACTGTAAGA TATCCAGTAA CTGCAATGCC
ATTAAATTTG GCACCAAATC TATGGCCGGA TTAAAAAATG TCAGCATATC AAATTGCAAT
ATACAGAAAG CCTCTGCTGA CCCCATCAGA CATTGGCAAA AGACATTAAA ATTTATAGAC
CAACCCATAA CGGTAATTTC AGGCATTGCA CTTGAAGCTG TAGACGGAGG TATAATTGAC
AGCATCAGCA TTTTTAATAT CACGATGAAA GACGTACAAA CACCCCTATT TATTGTATTG
GGTAATAGAG GTAATAAGCC AATGGGCGAT AAGAATTTCT ACAATACCTC TGCAGGCAAT
ACAGCACAAC AAGCTGTAGG AAAAATAAGT AATATTCAGC TTAAAAATAT CAAGGCGACA
AGTCATAGCA AAATGGCCAG TTCTATAACA GCATTTCCAG GCCATTACAT AGAAAATATA
ACACTAGACA ACATTGCATT TAACATTATG GGAGCAGGAA CCCAACAGGA AGCCATTACC
CCATTGATAG AGAACCCGGG TGCATATCCA GAAAACAGGA TGTACGGACT GGCCTATCCT
GCAAGCGGAT TTTTTATACG GCATGTAAAA AACCTATCTT TAAATCACAT CAAATTAAGT
GTCAGAAAAC CCGATTATCG TTCATCTATA ATTTTGGATG ACGTATTGGG AGTTAACATA
AGCAATGTAA ATTTACCGGT GCCTGAAGGA AACACCGCTG CTATTGGTTT AAAAAACAGT
AAGAATATAA AAGTCATCAA TCCTGTTTTT AAATCTGAAA ACCAACCATT GATACAACTA
GATGGCACAG CTGAACCTGA AATTGCCATT GCCGGGTTTA AAAAATATAA AGGATGGCTA
ACGTCTTTAT AA
 
Protein sequence
MKNRFKLLMI IFLGFIYSCR TEKNFNILDF GAVRDGKTLN TAAIQKAIDE CSKKGGRVVI 
PKGVYLSGTL YMKSNVELHI EEGAILKGSA SFKDYPDNKV TYKNAFTHFE DGKLYANKAF
IFAENVSNIS FTGKGTINGS GDSPEFNLGN DDTSISRSRP CMLLIIDSKH IKLNDLTLEN
SAYWLQNYLG CEFLELKGLK IYNQSNYNQD GMDIDAKHVL VEGCTLDVDD DGICFKSHDP
KRIVEDVVVR NCKISSNCNA IKFGTKSMAG LKNVSISNCN IQKASADPIR HWQKTLKFID
QPITVISGIA LEAVDGGIID SISIFNITMK DVQTPLFIVL GNRGNKPMGD KNFYNTSAGN
TAQQAVGKIS NIQLKNIKAT SHSKMASSIT AFPGHYIENI TLDNIAFNIM GAGTQQEAIT
PLIENPGAYP ENRMYGLAYP ASGFFIRHVK NLSLNHIKLS VRKPDYRSSI ILDDVLGVNI
SNVNLPVPEG NTAAIGLKNS KNIKVINPVF KSENQPLIQL DGTAEPEIAI AGFKKYKGWL
TSL