Gene Phep_1581 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1581 
Symbol 
ID8252683 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1868983 
End bp1870452 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content36% 
IMG OID644935235 
Productglycoside hydrolase family 28 
Protein accessionYP_003091856 
Protein GI255531484 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5434] Endopolygalacturonase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0871574 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAATA TACGTCATAT ATTAACCGTT ATAATCCTGT TATTTTCAAT TCCTGCTTTT 
TCTGAAAATA TCTATAACGC ATCGTTGTTT GGAATAAAAT CCAATGGCTC TACGATGAAT
ACAAACTCTA TTCAAAAAGC AGTAAACTAT ATTAGTGAAA AAGGGGGCGG AACACTGCGG
TTCTATGTTG GCAGATACCT GACCGGTACC ATTCAGCTCA AATCGAATGT AACTATCCTT
TTAGAAGAAG GAGCGATTAT CGTGGGTTCC ACAAATATTT ACGACTACAA TATTGATGTA
CCCAACCCGG CTTTAATATA TGCTAAAGGG GTAAATAACA TTGGTATTAA AGGTAAGGGT
GTTATTGAAG GACAGGGGCG GGTGGTAGCT TATGACCTGC TTGACCAAAT TCATAAAGGA
CTTATTGTAG ATGACATTAA AAATGACCGT CCGACCAATC GTCGTCCTAA AGCCATTTAT
TTTAGAGAAT GTAATCAGGT AAATATTGAT GGGATTAACA TTTGGAATGC AGCCGACTGG
GTGCAGGTAT ATGATCAATG TCAACAAGTA TTAATTAATA ACATTACCGT TAAAAGCAAC
GAATTCTGGA ACAATGACGG TATTGATATT GTGGATTGTA AGGACTTTAA ATTGTTAAAT
TCTTTTATCG ATGCAGCGGA CGATGCCATC TGTTTAAAAT CACATGATAG AACAAAGCGA
TGTGAAAATA TAGAGATCCG TAATTGTGTT GCCCGCTCAA GTGCCAATGG GATCAAATTC
GGAACGGTAT CAGCGGGTGG TTACAAAAAT GTGAAAATTA TCAATAATAA GGTTTACAAT
ACGTTTAGAT CAGCAATTAC AATTGCTTGT CCTGATGGAG GAGTTGCCGA AGATATTTTA
ATAGATAGCT TGTATGCTTA TAATACTGGA AATCCTATTT ACTTGCGTAT GGGCTCCAGG
TGGAATAACC AGCGGATAGG AGGAATGAAA AATGTAACCA TTCAAAATTT GTATGCTCAA
ATTACGGCAG ATAAACCTGA TAGTGGTTAT ATTTATGAGG GCCCCATTGA AGATAACCCC
AGAAATATCT CTCCTTCGAG TATAGTGGGT TTAAAAAATA TGATTATTGA AAATATTAAA
TTGAAGAATG TTGAAATCGT ATATCCAGGT GGAGGAAATC CAAATTATGC TTTCAGAGGA
ACTACAAAAG AGGATTTGGC TGGTATCCCT GAAATGAAGG ATGCTTATCC TGAATTTTCG
CAGTTTAAAG AACTTCCGGC ATGGGGTTTC TTCATTAAAT ATGCTAAGAA TATTTCTTTC
GAGAATGTAA AGTTAATTGC CTTGAATGCT GATTATCGCC CTTCGGTGGT TTTGGATCAC
GTTAATGGAT ACACCTTTGA TAAATTGGAT ATTAAAGAAA AGGGTAAACG GAGCAAGAAA
CAAATCATAG TGAATGATTC TAGTAAATAG
 
Protein sequence
MKNIRHILTV IILLFSIPAF SENIYNASLF GIKSNGSTMN TNSIQKAVNY ISEKGGGTLR 
FYVGRYLTGT IQLKSNVTIL LEEGAIIVGS TNIYDYNIDV PNPALIYAKG VNNIGIKGKG
VIEGQGRVVA YDLLDQIHKG LIVDDIKNDR PTNRRPKAIY FRECNQVNID GINIWNAADW
VQVYDQCQQV LINNITVKSN EFWNNDGIDI VDCKDFKLLN SFIDAADDAI CLKSHDRTKR
CENIEIRNCV ARSSANGIKF GTVSAGGYKN VKIINNKVYN TFRSAITIAC PDGGVAEDIL
IDSLYAYNTG NPIYLRMGSR WNNQRIGGMK NVTIQNLYAQ ITADKPDSGY IYEGPIEDNP
RNISPSSIVG LKNMIIENIK LKNVEIVYPG GGNPNYAFRG TTKEDLAGIP EMKDAYPEFS
QFKELPAWGF FIKYAKNISF ENVKLIALNA DYRPSVVLDH VNGYTFDKLD IKEKGKRSKK
QIIVNDSSK