Gene Phep_2779 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2779 
Symbol 
ID8253887 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3281812 
End bp3282948 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content47% 
IMG OID644936425 
Productexported exo-alpha-sialidase 
Protein accessionYP_003093040 
Protein GI255532668 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4692] Predicted neuraminidase (sialidase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAAAAA GAAGCACAGG TATAATCGGT TTACTGATTT GTTTAACCGG AGGATATGGT 
TTAAAGGCAC AATCGCAAAA ATGGCGGTCA GGTATTATAA CAGACGAATT CTTATATGAG
AAGGCAGCAT TTCCTTCCTG CCATTCTGCT ACCATAGCCG AAACGCCTAC CGGATTGGTG
GCAGCCTATT TTGGTGGTAC CCATGAACGC CATCCGGATG TGGAGATCTA TGTGAGCAGG
CAGGTTAACG GAACCTGGCT TGCTCCGGTT TCTGTAGCCA ATGGTATACA AAACGACAAA
GTAAGGCTGC CTACCTGGAA CCCTGTATTG TACCAGGTAC CTGGTGGAGA ACTGTTGCTT
TTTTACAAAA TTGGCCCTAA GCCATCCGAA TGGTGGGGCA TGATGCGCAG CTCAAAAGAT
GGCGGCATTA CCTGGTCTGA AGCGCAGAAA TTACCGGAAG GCCAGATTGG CCCAGTAAAA
AACAAACCGG TGCTGCTCAG CAATGGTAAC TTGTTCTGTC CTTCCAGTAC AGAAGGCAAA
GGCTGGAAAG TCCATTTCGA AGTAACCAAA GACAATGGCA AAACCTGGCG CTTAATCGGC
CCGCTGGAAG GTGGGGAGAT CAATGCTATA CAGCCAAGTA TCCTGGATCA TGGCAATGGA
AAACTACAGA TCCTGGCCAG GAGCAGGAAC AGGGCAATTG TAGAATCCTG GTCGCAGGAC
AACGGTGAAA CCTGGTCTGC TTTAGCAAAA ACGTCCCTGC CAAACAACAA TTCAGGCACC
GATGCAGTAA CTATGAAAGA TGGCAGACAT GTATTGGTAT ACAACCATGT ACTGCCTCCC
GGAGACCTGG CAAAAGGGGC CCGGACGCCA TTAAATGTAG CGATTTCCAA AGATGGTAAA
AACTGGTCGG CAGCGTTGAT CCTTGAGGAT TCGCCCACCA GCCAGTATTC CTATCCTGCG
GTAATCCAAA CCTCAGATGG TTTGCTGCAT TTCATTTATA CCTGGAGAAG GGAAAAGATC
AAACATGTAG TAGTTGATCC ATCAAAACTT AAGCTTAAAA AGATAGTAAA TGGCATTTGG
CCAAAATTAA AGGGCTATAC AGCCCCTGTG GTTACTGATG TTAAAAACGA GGAATAG
 
Protein sequence
MLKRSTGIIG LLICLTGGYG LKAQSQKWRS GIITDEFLYE KAAFPSCHSA TIAETPTGLV 
AAYFGGTHER HPDVEIYVSR QVNGTWLAPV SVANGIQNDK VRLPTWNPVL YQVPGGELLL
FYKIGPKPSE WWGMMRSSKD GGITWSEAQK LPEGQIGPVK NKPVLLSNGN LFCPSSTEGK
GWKVHFEVTK DNGKTWRLIG PLEGGEINAI QPSILDHGNG KLQILARSRN RAIVESWSQD
NGETWSALAK TSLPNNNSGT DAVTMKDGRH VLVYNHVLPP GDLAKGARTP LNVAISKDGK
NWSAALILED SPTSQYSYPA VIQTSDGLLH FIYTWRREKI KHVVVDPSKL KLKKIVNGIW
PKLKGYTAPV VTDVKNEE