Gene Phep_2839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2839 
Symbol 
ID8253947 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3381376 
End bp3382590 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content40% 
IMG OID644936485 
Productglycosyl hydrolase family 88 
Protein accessionYP_003093100 
Protein GI255532728 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00286392 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAATAAAA AACCGATAAT AAACAAATTG ATAGTGGTTG CTTGTGCTGG TTTGGCATTA 
GGTTTAACCC TGGCTTGTGC AAAACCAACT GCTCAACTTA CAGATGAGAA AATTGCTTCG
ATTTTTGAAT TGGAGAAGAT ATATGCCGAC AATGCCTTTA AAATCGTAAA ATCGACCGGT
AAAATGCCGC GTTCATTAGA GAAAGGTTTT CAACCTATTT CTGACTGGAC CAGCGGCTTT
TATCCTGGAA ATTTGTGGCT GGTGTATGAA TTTACCAGGG ACAAAGACAT CCTTAAAAAA
GCCGAATATG CTACGGCCCT GGTAGAGGAC AATAAGGACT ATACGCATGA CCATGATATT
GGTTTCATGA TTTATAGCAG TTATGGCAAT GGGTACCGGC TTACTAAAAA TGAAAAATAC
AAAGCAGTGA TGATTGAGGC CTCTAAATCG GCCATTAAAA GGTATAATCC TAAAGTGAAA
TCAATTATGT CCTGGAATCC CAGTGCAGCA CGGGACTGGA AATTTCCGGT AATTATCGAC
AATATGATAA ATCTGGAACT GTTACTGAAT GGGACCGGAT TTACCGGTGA CAGTACTTAT
TACAATGTAG CGGTTAATCA TGCCAATACG ACCATGAAAA ACCAGTACAG GTCAGACTAC
AGTTGTTCGC ATGTCGTAGA TTATGACCCG CTTACAGGAA AAATGCGTAA ACGTGACTGG
AACAATGGAG ATAGCAACCC TGAAACCGCG TCCTGGAGCC GTGGCCAGAG CTGGGGGTTG
TATGGGTTTG GATTTATGTA TAAATACACA CATAAAAAAG AGTATTTAAC CCAGGCAGAA
AATATAGCTG CATATATTCT AAACAATCAG AATATGCCTG AAGATATGGT ACCTTACTGG
GATTATCGCG CACCGAAAAT CCCTACCCAG AAAGATGCCT CGGCAGCTGC ATTGCTTGCA
TCAGGCTTAA TGCAGCTTGC AGAATTGTCA CCGGCCAATG GCAAAAAGTA CTTTAAAGCG
GGCGAAAAAA TTCTGGAAAG CCTTTCTGCC GCACCTTACC TGAACGAGAG TGGCAAAGGC
AATTATCTAC TGAACCATGC TACCGGTAAT TTCTTAAGAA AATCTGAAGT AGATGGCGGT
CTGATATATG CAGATTATTA TTATTTAGAA GCTTTATTGA GGTATCAGAA ATTGAAGCAA
AAAATTGAAT TTTAA
 
Protein sequence
MNKKPIINKL IVVACAGLAL GLTLACAKPT AQLTDEKIAS IFELEKIYAD NAFKIVKSTG 
KMPRSLEKGF QPISDWTSGF YPGNLWLVYE FTRDKDILKK AEYATALVED NKDYTHDHDI
GFMIYSSYGN GYRLTKNEKY KAVMIEASKS AIKRYNPKVK SIMSWNPSAA RDWKFPVIID
NMINLELLLN GTGFTGDSTY YNVAVNHANT TMKNQYRSDY SCSHVVDYDP LTGKMRKRDW
NNGDSNPETA SWSRGQSWGL YGFGFMYKYT HKKEYLTQAE NIAAYILNNQ NMPEDMVPYW
DYRAPKIPTQ KDASAAALLA SGLMQLAELS PANGKKYFKA GEKILESLSA APYLNESGKG
NYLLNHATGN FLRKSEVDGG LIYADYYYLE ALLRYQKLKQ KIEF