Gene Phep_1703 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1703 
Symbol 
ID8252805 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2017515 
End bp2018690 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content42% 
IMG OID644935355 
ProductExo-alpha-sialidase 
Protein accessionYP_003091976 
Protein GI255531604 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4409] Neuraminidase (sialidase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.137706 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0157049 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATACCT CAATCAAAAA ACCAAAAATG ATTGGCGGAG TGTTTTGTTT TCTGATGCTG 
ATCTTTTTTT GCAGGGACAT TGCTTCGGCA GCTTCTTTAA CATCTGGAAT TTCTCGTCTG
GATAGTTTGA ATTTCATTTA TAAAGCCGGA GAAAATGGCT ATTCCTGTTT TAGAATCCCA
GCCCTTATTT ATACAAAAGA TGGGACTCTG CTGGCTTTCG CAGAAGCCAG AAAAAATAAT
TGCGGTGATT CGGGTGATAT AGACCTCGTG ATCAAAAGAT CGTCGGACAG GGGCAAAACC
TGGAGTAGTT TACAGGTGGT ATGGAGTGAT TCGACCAATA CCTGTGGCAA TCCGGTCCCT
GTACAGGATA GGGGTACCAG CCGGATTTGG CTGATATCCA CATGGAACCT GGGGACATTT
CATGAAAAAC AGATCATCAG TGATGCCGCA AAAAAAGGAA GACATGTGTA TAAGCTTTAT
TCTGATGATG ATGGCAGGAG CTGGTCGGGG CCAACGGAAA TTACGGACCA GGTAAAGAAG
CCAGACTGGT CGTGGTATGC TACCGGACCT TGCCATGGGC TGCAAATAAC CAATGGAAAA
TATGCCGGGC GATTGGTGAT TCCCATAAAC CACATTGAAA GAGGTACCAA TCAAAATTTT
GCGCATATCA TCTACTCAGA TGATCATGGT AAAAGCTGGA ACCTGGGCAA CAATACCCCC
CAGGATAAAA TGAATGAAAC TACTGTCGCC GAAATTTCTA AGGGCCGTTT AATGCTGAAT
ATGAGAAATG CAGACCGGAG CATTAAAACC AGGCATACTG CTATTAGCAG CAATGGTGGA
TTGAGCTGGA ATAATGTTGA AAAGGATACG GTTTTAATAG AACCCATTTG TCAGGGTAGC
TTGTTGAGTC ACTTTTACAA TAAAAAGAAA CCCGTTTTGC TGTTTACCAA TCCTGCAAAT
GCTAAGCTGC GAGCTAATAT GACGTTAAGG ATGAGCCTGA ATGATGGGAA GACCTGGAAA
CACAATTTAG TTTTACATGC TGGCCCATCG GCTTATTCTG ACATCGCGCT TATAGATAAA
ACCACAATTG CGAGTTTTTT TGAGGCGGGG TATGAAAAGC CTTATGAAGG TATCGTATTT
AAAATCGTCA ATTATTCAGA TCTAATACAA AACTAA
 
Protein sequence
MYTSIKKPKM IGGVFCFLML IFFCRDIASA ASLTSGISRL DSLNFIYKAG ENGYSCFRIP 
ALIYTKDGTL LAFAEARKNN CGDSGDIDLV IKRSSDRGKT WSSLQVVWSD STNTCGNPVP
VQDRGTSRIW LISTWNLGTF HEKQIISDAA KKGRHVYKLY SDDDGRSWSG PTEITDQVKK
PDWSWYATGP CHGLQITNGK YAGRLVIPIN HIERGTNQNF AHIIYSDDHG KSWNLGNNTP
QDKMNETTVA EISKGRLMLN MRNADRSIKT RHTAISSNGG LSWNNVEKDT VLIEPICQGS
LLSHFYNKKK PVLLFTNPAN AKLRANMTLR MSLNDGKTWK HNLVLHAGPS AYSDIALIDK
TTIASFFEAG YEKPYEGIVF KIVNYSDLIQ N