Gene Phep_2845 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2845 
Symbol 
ID8253953 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3393846 
End bp3395048 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content45% 
IMG OID644936491 
Productglycosyl hydrolase family 88 
Protein accessionYP_003093106 
Protein GI255532734 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0713509 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAG CTTCCAGCAT GCTTTTAGCT GCTTCTACCC TTGCTTTAAC CTGCTTTGCC 
CAGCCCAGGC AGGTTAACCC AGCATTCAAC TGGTTTAAAA ACGCTACGAA AACAATAGAC
TACCAGCTCA ATAAAGCAGC AAACACCTAT AAACCTGGTC AAAATCCCCG TTCTGTAAAT
CCTGATGGCA AAGTGAGGCT GGCTGGCCTT ACCGATTGGA CTACAGGTTT TTTTCCGGGA
TCACTCTGGT ATGGATATGA ACTTACCGGA GATCAAAAAC TGGCAGACAA AGCAAAACGT
TTTACCCTGG CACTTGATTC TATCCGCCAC ATTAAAAATA CCCACGATTT AGGCTTCATG
TTGTACTGCT CCTACGGCAA TGCCTATCGC ATTACAGGTG ATAAAACCTA TCTGCCAGTG
CTCACTGAGA GTGCCCAGCA TCTTTACGAA AGATTTAACC CTAAAGTGGG TGTCATCCGT
TCCTGGGATT TTGCGCCATG GCACTATCCA GTAATTATCG ATAACATGAT GAACCTGGAA
TACCTGTATT GGGCAGCAAA CACCTTTAAC AAACCCGCAT ATGCCCAGGC AGCCAATACC
CATGCCATTA CCACCATGAA AAACCACTAT AGAAAAGACA ATAGTTCGTA TCACGTGGTA
GATTACGATC CTGCAACAGG ACAAGTATTG CGCAAGGTTA CCCACCAGGG TCTTACCGAC
GAGTCTGCAT GGGCCCGCGG ACAAGCCTGG GGGCTTTACG GCTACACAAT GTGTTATGCC
AATACCAAAA GCCAGACCTT CTTAGATCAG GCCGAACGCA TAGCCTCCTT TATCATGAAC
CATCCCCGCA TGCCAAAAGA CAAAGTTCCG GTATGGGATT TTGATGTGCA CAATGCCCTC
GATATAGATG AACGCGCACC AAGAGATGCC TCGGCAGCGG CTGTTATTGC GTCTGCTCTG
CTGGACCTGA GTACCCATGT AAAAGACGGA AAAAAATATG TTGATTATGC AGAAGACATA
CTCAGATCAC TCTCTTCCGA TGCTTACCTG GCCAAACCAG GCGAAAACAA GTTCTTCATC
TTAAAACACA GTGTTGGGGC TTTTTTGTAC AATTCTGAAA TTGACACCCC TATAGATTAT
GCCGACTATT ATTACCTGGA AGCCTTAAAA AGATATGCCA GCATTAAAAA AATAAACCTT
TAA
 
Protein sequence
MKKASSMLLA ASTLALTCFA QPRQVNPAFN WFKNATKTID YQLNKAANTY KPGQNPRSVN 
PDGKVRLAGL TDWTTGFFPG SLWYGYELTG DQKLADKAKR FTLALDSIRH IKNTHDLGFM
LYCSYGNAYR ITGDKTYLPV LTESAQHLYE RFNPKVGVIR SWDFAPWHYP VIIDNMMNLE
YLYWAANTFN KPAYAQAANT HAITTMKNHY RKDNSSYHVV DYDPATGQVL RKVTHQGLTD
ESAWARGQAW GLYGYTMCYA NTKSQTFLDQ AERIASFIMN HPRMPKDKVP VWDFDVHNAL
DIDERAPRDA SAAAVIASAL LDLSTHVKDG KKYVDYAEDI LRSLSSDAYL AKPGENKFFI
LKHSVGAFLY NSEIDTPIDY ADYYYLEALK RYASIKKINL