Gene Phep_3963 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3963 
Symbol 
ID8255097 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4769321 
End bp4770508 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content41% 
IMG OID644937627 
Productglycosyl hydrolase family 88 
Protein accessionYP_003094216 
Protein GI255533844 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0599446 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTTCAC CCCTGACCAA AAGAATGATG AAAACTATTT TTTTAGCGGT TACGCTTTTA 
ATAACGACAC TCAGGTTACA GGCACAACCT AAGGTTGATG TAAATAAAGA AATCGAATTT
GCCAAAGCAC AATATGAATT GATGCTAAAA GCGAATACAG ACCTCAGCAG GTTCCCTCAG
TCTGTAAAAC AAGACGGGAC ATTGGATACC CGGACATCAG ATTGGTGGTG CAGTGGTTTC
TTCGGCGGGT CATTATGGTA TCTTTATGAA TTTACCAGCG ACGATAAATG GAAAGTGGCT
GCGGATAAAT GGACCATGGC TGTAGAAAAA GAAAAATACA ATAAGACCAC ACACGATTTG
GGGTTTATGT TGTATTGTTC CTTCGGAAAT GGTTATCGTT TAACCAACAA TGAGCAATAC
AAGGACATTA TGCTTGTTGG GGCAGAATCG CTGGCTACGC GCTTCAATCC TAAAATTGGG
CTGATCAAGT CCTGGGAGGA GTTTAAAGGC TTTGACTATC CTGTGATCAT AGACAATATG
ATGAATCTCG AATTCTTATT GTGGGCAGTT AAAGCATCAG GAAACCGCAA ATTTCATGAT
ATAAGTATTA CACATGCTGA CAATACCCTT AAAAACCATT TTAGAAAAGA TTATAGCAGC
TATCATGTAG TTTGTTATGA TACTGCAGGA AAAGTACTTG CCAGGAAAAC CAATCAGGGG
GCTGCAGACG AATCTGCCTG GGCCAGAGGG CAGGCATGGG CTGTTTACGG ATATACCATG
ATGTATAGGG AAACTGGAAA TAAGAAGTAT CTTAACCAGG CTATAAATAT TGCAAAGTTT
ATTGCCAGTC ATCCGAACCT TCCATCGGAT AAGATCCCCT ATTGGGATTT TAACGCACCG
GATATACCGA ATGAAGAAAG AGACGCGTCA GCAGCGGCCA TTACGGCTTC TGCTTTATTG
GAATTGTACA CCTATACCAA CGATAAGGCA CAGTTCAGAC TGGCAGAAGA CATGTTGGCA
AGTCTTTCGG GTAAAGTATA CACTGCAAGT CCGGGCAAAA ACCACAATTT CCTGCTCAAA
CATTCGGTAG GTTCAAAGCC CTATAAATCA GAGGTAAATA CACCCATAAT TTACGCAGAT
TATTATTACC TGGAGGCATT ATTAAGGTAT AGCAAACTAT TGAAATAA
 
Protein sequence
MLSPLTKRMM KTIFLAVTLL ITTLRLQAQP KVDVNKEIEF AKAQYELMLK ANTDLSRFPQ 
SVKQDGTLDT RTSDWWCSGF FGGSLWYLYE FTSDDKWKVA ADKWTMAVEK EKYNKTTHDL
GFMLYCSFGN GYRLTNNEQY KDIMLVGAES LATRFNPKIG LIKSWEEFKG FDYPVIIDNM
MNLEFLLWAV KASGNRKFHD ISITHADNTL KNHFRKDYSS YHVVCYDTAG KVLARKTNQG
AADESAWARG QAWAVYGYTM MYRETGNKKY LNQAINIAKF IASHPNLPSD KIPYWDFNAP
DIPNEERDAS AAAITASALL ELYTYTNDKA QFRLAEDMLA SLSGKVYTAS PGKNHNFLLK
HSVGSKPYKS EVNTPIIYAD YYYLEALLRY SKLLK