Gene Phep_2841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2841 
Symbol 
ID8253949 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3385601 
End bp3387022 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content41% 
IMG OID644936487 
ProductAlpha-L-fucosidase 
Protein accessionYP_003093102 
Protein GI255532730 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3669] Alpha-L-fucosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.252312 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.00495105 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAACGAA TTGGCATTGT TTTTTTCTTA TTAGTGATGA TATCATGTTC ACTAAGTGCA 
CAAGATAGTA AAATGGAGTG GTGGCGCGAA GCCCGTTTTG GAATGTTCGT TCATTTTGGT
GTGTACGCCC AGTTTGGGGG TGTTTACCGG GGTCATGAAC AAAAAGTTAA AAATGGAGAA
TGGTTGATGA ACCGGATGAA AGTCCCCGTT CAGGAATATA AAGATACAGC TGGACATTTT
AATCCCATAA ATTTCAATGC TGATGAATGG GCACGTATGG CCAAAGACGC AGGCATGAAA
TATTTGGTTA TTACGGCAAA GCACCATGAT GGATTTGCAT TGTTTGACAC CAAAGCCAGC
GATTGGAATA TTATGAAAGC ATCACCTTAT GGTAAAGATA TGATTAAACC TTTAGCTGAT
GCTTGCAGAA AGTATGGTTT AAAATTTGGT ATTTATTATT CACAATCACA AGATTGGGGA
AATCCTGGTG GTTTTACTGG TCGCAGGGTC ATGAAACAAG GATGGGATAA TCCGGATAGC
ACGCGTATCG ATGCCTATAC TTTAGCGCAC AATGGCAGTT GGGACCCCAT ACAGCAATCA
AAAACTTTCA ACGAGTATTT AAACGGTGTA GCTTTTCCAC AGGTAAAAGA GTTATTGTCT
AACTATGGCG ATGTTTCCGT ATTGTGGTGG GATACACCTG GAGGATTATC TGCTGAACAG
GCCAGACAAA TGATGACTAT GGTGAAGCAC TTGCAGCCTA ATATCATTAC CAATGACAGG
CTGGGAGGAA ATTTGCCAGG TGATTTTAAA ACTCCTGAAC AAAAAATACC CAATCTGGCA
GAGCTTGATG GTAAAGATTG GGAAACCTGT ATGACCATGA ACCGCACCTG GGGCTATCGC
ACAGCCGATC ATGAATGGAA GTCTTCTTCA GAACTGATCC AGAAATTGAT AGATATTGCT
GCTAAAGGGG GCAATTACCT CCTCAATATC GGTCCTAAAC CAGATGGTAC TTTCCCCGTT
GAAAGTGTTG AACGTTTAAA AGAAATAGGT AACTGGATGA GCAAATATGG TGAAGCTATT
TATGGCACCC ATGCTAACCC TGTTGAGCCT GTAGATTGGG GACGCATTAC TGCCAAAGAT
GAACAACATG GTACTGTACT TTATTTGTCC GTTTTCAATT GGCCCGGCAG TGGTCAGCTA
AACATTGATG GGCTGGGTAA TAAGGCAATT GGTGCCACAT TATTAAACAG CAATGCAAAG
CTGAAACTTA AACAGGATGA GCAGGGTATT TTAGAAATTA GTGGTCTGCC TGTCAGTGCA
CCTGATAAAA CAGCAAGTGT AATTGCTTTG AAGCTGAATG GTTTTGCAAA AAAGAAAGAT
TTTAATCCGG AAAAGAAAAT GAAATCGGGT TCAATAGATT AA
 
Protein sequence
MKRIGIVFFL LVMISCSLSA QDSKMEWWRE ARFGMFVHFG VYAQFGGVYR GHEQKVKNGE 
WLMNRMKVPV QEYKDTAGHF NPINFNADEW ARMAKDAGMK YLVITAKHHD GFALFDTKAS
DWNIMKASPY GKDMIKPLAD ACRKYGLKFG IYYSQSQDWG NPGGFTGRRV MKQGWDNPDS
TRIDAYTLAH NGSWDPIQQS KTFNEYLNGV AFPQVKELLS NYGDVSVLWW DTPGGLSAEQ
ARQMMTMVKH LQPNIITNDR LGGNLPGDFK TPEQKIPNLA ELDGKDWETC MTMNRTWGYR
TADHEWKSSS ELIQKLIDIA AKGGNYLLNI GPKPDGTFPV ESVERLKEIG NWMSKYGEAI
YGTHANPVEP VDWGRITAKD EQHGTVLYLS VFNWPGSGQL NIDGLGNKAI GATLLNSNAK
LKLKQDEQGI LEISGLPVSA PDKTASVIAL KLNGFAKKKD FNPEKKMKSG SID