Gene Phep_3253 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3253 
Symbol 
ID8254372 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3859580 
End bp3861151 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content44% 
IMG OID644936906 
Productglycosyl hydrolase BNR repeat-containing protein 
Protein accessionYP_003093510 
Protein GI255533138 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4409] Neuraminidase (sialidase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.962324 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.018694 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAACA GAAAAATATT TAACCATTTT ATCCTGTTCA TCTTTATAGC AGCTGCATTA 
GCTTGTTCAG ATAAAATTTA CAAAGCCCGG GATAGGGGCG GCATAAGCAT TAAAGCCGAA
CCAACCATTA ATCCTATCTT TAAAAGACTG GAAGTCAATC CGTATCTGCG TCTGGAAATT
GACATCCCAG AAGGTACTGA GGCGATAACA TTCCAGCAAT TTAAAGGCAG TATAGAAACA
GCAGGATTAC AAGATATCGC AAAGCTGGAG CTTTACCAGG GAGATGATCA GCAGGAACTT
TCAAAAAATA AATTGCTGGG GAGTACAATA CCCTCAACAA ATCAGTTTAG CATTACGCTT
GGCACGACTT TAACACCAGG AAAACACAGC TTATGGCTAA GTGTAACCTT AAAAGACAAT
GCTGATATTG ATCATCAGCT GCGGATCAGA GCTGATCAAT TGACCAACGC ATCAGGCCTG
ATCTATAAAG TGGCACAACA CCAGATCAGT TCAAGATACC TGGGCATCGC CCTGCGCAAA
CCCAATGACG AAAATGTACA CACTTACCGC ATCCCCGGTA TGATCACCAC AGACAAAGGA
ACTTTAATTT CCGTTTACGA TATCCGTTAT GACAATGACA AAGACCTGCC GGGCAATATT
GACGTAGGAA TGAGTAGAAG TACCGACGGA GGTAAAACCT GGGATACCAT GAAAAATATT
ATGGATATGG GTGGACCGGC AGATAACAGT GGTTCCGGCG ATCCTTCAAT CTTATTTGAC
CCTGTCACTA AAACCATATG GGTTTCAGCC CTATGGAGTA AAGGTAACCG CTCTATTGCA
GGCTCAGGAC CCGGTTTAAG TCCTGAAGAA ACCGGGCAGT TCCTGGTTAC CAGCAGTAAG
GATGACGGAT TAACCTGGAC CAAACCCTAC AGCATCACTA ACCAGGTTAA AAATCCGGAA
TGGCGCTTGT TTTTCCCTGG TCCGGGTAAT GGAATTGCCA TGGCAGACGG GAAAATTGTT
TTCCCGGCAC AATACTGGGA TGCCGCAAAA ATGCCGCATT CCACCTTAAT CTATAGCGAT
GACCATGGTA AAAGCTGGAA AGCAGGGCTT GGTGCAAAGT CAAATACCAC AGAGGCCCAG
CTTGTAGAAA CAAACCCGGG AACTTTAATG CTGAACATGC GGGACAACAG GGGTGGGTTC
AGGAGCGTAG CTACCACAAA AGATATGGGA CAAAGCTGGA TCGAACATGC AACGTCCTAT
AGTGCCTTAC CCGACCCGGT TTGTATGGCC AGTTTAATAA AAGTCAATGT AAAATTTAAG
CGCGTATCAA AGGATGTCCT GTTTTTCAGC AATTTGAATA TTTCAACGCC TCCCAGGGCA
CATACTACCA TTAAAGCTAG TCTGGATTTA GGAGAGTCCT GGCAACCTGT AAATCTATTG
CACCTTGATG AACGTAAATC TTACGGCTAT TCCGTACTTA CTAAAATAGA TGACCAGACC
CTGGGTTTGC TATATGAAGG CATCAGGACT TTGCTGTTTG TTAAAATTCC CGTAAAGGAT
ATCATTAAAT AA
 
Protein sequence
MNNRKIFNHF ILFIFIAAAL ACSDKIYKAR DRGGISIKAE PTINPIFKRL EVNPYLRLEI 
DIPEGTEAIT FQQFKGSIET AGLQDIAKLE LYQGDDQQEL SKNKLLGSTI PSTNQFSITL
GTTLTPGKHS LWLSVTLKDN ADIDHQLRIR ADQLTNASGL IYKVAQHQIS SRYLGIALRK
PNDENVHTYR IPGMITTDKG TLISVYDIRY DNDKDLPGNI DVGMSRSTDG GKTWDTMKNI
MDMGGPADNS GSGDPSILFD PVTKTIWVSA LWSKGNRSIA GSGPGLSPEE TGQFLVTSSK
DDGLTWTKPY SITNQVKNPE WRLFFPGPGN GIAMADGKIV FPAQYWDAAK MPHSTLIYSD
DHGKSWKAGL GAKSNTTEAQ LVETNPGTLM LNMRDNRGGF RSVATTKDMG QSWIEHATSY
SALPDPVCMA SLIKVNVKFK RVSKDVLFFS NLNISTPPRA HTTIKASLDL GESWQPVNLL
HLDERKSYGY SVLTKIDDQT LGLLYEGIRT LLFVKIPVKD IIK