Gene Phep_2210 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2210 
Symbol 
ID8253316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2546465 
End bp2547865 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content44% 
IMG OID644935859 
ProductSialate O-acetylesterase 
Protein accessionYP_003092476 
Protein GI255532104 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.32552 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACCAT CAAAATTTTT GCTGTTACTG TGTACAGCTT TACTTTTTTC TTTAAACACT 
TTTTCAAAAG TTACCCTGCC GGCCATCATA TCCAGCGGAA TGGTACTTCA GCAACAAAGC
AATGTAGCAC TTTGGGGCAA GGCAAAAGCT GGCTCAAAGG TCACCATTAC CACTTCATGG
AACCAGAAAG CATACCTCGC CACAACCGCA AAAGATGGTT CCTGGAAACT TTCTGTTCCA
ACACCAAAAG CAGGCGGCCC GTATACCATT ACATTTAATG ACGGAGAAAA ACTGGTCCTG
AATGATATCC TGATCGGTGA AGTATGGCTT TGTTCGGGAC AATCCAATAT GGAAATGCCT
GTAAAAGGCT TTGGCAATCA ACCCATCACC AATTCTAACG AATTGTTAAG TGATGCCGAT
GAACCGGGAG TAAGGTTGTT CAGAATTGAA AAAAATATGT CCAGAACCCC GCTAACGGAG
TTAAATGCCA AATGGGAGCA CAGCAATTCC GAAACCACCG GACAATTCAG CGCAGTGGGT
TACCAGTTTG CACGCATGTT ACAGCAAAAG TTAAAGGTTC CGGTAGGCAT TATCCAGTCT
GCTTATGGGG GTACCATCAT CGAAGCCTGG ATGGACAAAA AGAGCTTTGC CGGTTTTACA
GATGTTAAAA TCCCGGCTGA TACAGTTAAA ATGATCAAGA ATGAACCTTT TGTATTGTTC
AATGCCATGA TCAATCCAAT TGTTGGCTTC AATATCAAAG GTGCCCTTTG GTACCAGGGT
GAAAACAACT GGTTTACGCC CGATACCTAT GACAAAAAAA TGGAAGCCAT GGTAAAGGAG
TGGCGTTCGA TATGGGGATG TGGCGATTTT TCATTTTATT ACGTGCAACT GGCGCCAAAC
GCTTATCCGA ATGGCAAGGA TAAACTGCCG GTAATCTATG AAAAGCAGGC AAAAGCGATG
CAGCTGATCC CCAATTCAGG AATGGCAGTG AGTGTAGATG CAGGCAGCCA GACCACCATT
CATCCGCCCG ATAAAACAAT TATTAGCAAA CGACTGCTTT ACTGGGCATT GAATAAAACT
TATGGAAAAA AAGGGGTGGC GTATTCAGGC CCGGTATATC AATCTTTAAA GATCAGCGAC
AATAAAGCCA TTGTAAGTTT CAGCGAAATC CCTATTGGCC TTACAGCCTA TAACAAGCCG
CTCATTTCCT TTGAAATTGC AGGTGCAGAT CAAGTATTTC ATCCGGCCGC AGCAACCATT
TCCGGGAAAA CCGTAGTGGT ACAAAGTGAT GAAGTAAAAA GCCCCGTAGC AGTAAGATAT
GCTTTTAAAG ACAGAGCTGA GGGTAATTTA TACAATGCAG AAGGTTTACC TGCTGCACCG
TTCAGAACCG ACAGCTGGTA G
 
Protein sequence
MKPSKFLLLL CTALLFSLNT FSKVTLPAII SSGMVLQQQS NVALWGKAKA GSKVTITTSW 
NQKAYLATTA KDGSWKLSVP TPKAGGPYTI TFNDGEKLVL NDILIGEVWL CSGQSNMEMP
VKGFGNQPIT NSNELLSDAD EPGVRLFRIE KNMSRTPLTE LNAKWEHSNS ETTGQFSAVG
YQFARMLQQK LKVPVGIIQS AYGGTIIEAW MDKKSFAGFT DVKIPADTVK MIKNEPFVLF
NAMINPIVGF NIKGALWYQG ENNWFTPDTY DKKMEAMVKE WRSIWGCGDF SFYYVQLAPN
AYPNGKDKLP VIYEKQAKAM QLIPNSGMAV SVDAGSQTTI HPPDKTIISK RLLYWALNKT
YGKKGVAYSG PVYQSLKISD NKAIVSFSEI PIGLTAYNKP LISFEIAGAD QVFHPAAATI
SGKTVVVQSD EVKSPVAVRY AFKDRAEGNL YNAEGLPAAP FRTDSW