Gene Phep_2223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2223 
Symbol 
ID8253329 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2568474 
End bp2569673 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content44% 
IMG OID644935872 
ProductExo-alpha-sialidase 
Protein accessionYP_003092489 
Protein GI255532117 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTAATTT TTACCAGAGC ACACCACAAA CTGTTTGCAC TCGAAACCGC TCTATTTTTC 
ATCACTATTT GTTTTTTTTT GACATTTATA ACAACAGGAT GCAAAACAAC AAAACAGAAC
ATTTCAAGGT CAGGATTAGA CAGTACCTTG GTTTTTACAC CCGATAAAAC TTATACTTCT
ATGCGGATAC CGGCTTTGGT GATTACCCAA AAAGGCACTT TACTGGCATT TTGTGAAGGC
AGGATAGGAA CAGCAAGCGA CTGGGCAGAT ATGGATCTGC TGATGCGCAG GAGTACAAAT
GGTGGAAAAA CCTGGGAACC CCATGTCATT ATTGCCGCAA AAAAAACCGG AGAACCTACA
AGTAATGCCA CTCCAATAGT AGATAAAGAT GGAACCATCC ATTTGCTATA CCAGCGGGAT
TATGCGAGGG CCTACTATAC CTTTTCAAAA GATGACGGAA AAACATGGAG CAAGGCTGCA
GACATTACCT ATGCCTTTGA TGCTTTTAAA CCGGAGTATG ACTGGAAAGT ACTTGCACCA
GGACCAGGAC ATAGTATCCA GTTAAAAAAT GGCCGTTTGC TGGTTCCTGT ATGGTTAAGT
AATCCGGCTA AAATGCTGCC CAGAAGAAGT CACGCGCCAT CTTGTATTGC TACCATTTAT
AGTGACGACC TGGGCCACAC CTGGAAAAGG GGAGCAATTA TAGCAGACAA CAATCCCGAT
TTTAAGAACC CCAGCGAAAC TATGGCCATC CAACTTAAAG ATGGACGTGT GATGGTCAAT
ATCAGAAACG TAACAGAAAA GCACCGCAGG GGCCTTAGCT ATAGTAAAGA CGGGATCAGC
GGGTGGAGCA AACCTGTTTT CGACGAAGAA TTGTTTGAAC CTGTATGCAT GGCTACCATT
ACGCGCCTGC CGGAAAAGCT GGGAGGGGGA ATGCTGTTTA TCAATCCCGA CAGCAGGGAC
ATTCCTAAAT ATCCGCGCAA AAACCTCACT GCCAGGATCA GCAATGATGA AGGGCAAAGC
TGGCCGGTTA AAAAAGTAAT CGATACAGGA ACCTCAGGCT ATTCTGATGT AGCTGTTGGA
GCAGACGGAA CCATATATTG TTTATACGAG ACCAACTCAA ACCCCGGCAG AAATTTTAAT
TACAGCCTGG TTTTGAAACG TTTCAGTTTA AACTGGCTAA CCGGCACATC TAAGAAATAA
 
Protein sequence
MLIFTRAHHK LFALETALFF ITICFFLTFI TTGCKTTKQN ISRSGLDSTL VFTPDKTYTS 
MRIPALVITQ KGTLLAFCEG RIGTASDWAD MDLLMRRSTN GGKTWEPHVI IAAKKTGEPT
SNATPIVDKD GTIHLLYQRD YARAYYTFSK DDGKTWSKAA DITYAFDAFK PEYDWKVLAP
GPGHSIQLKN GRLLVPVWLS NPAKMLPRRS HAPSCIATIY SDDLGHTWKR GAIIADNNPD
FKNPSETMAI QLKDGRVMVN IRNVTEKHRR GLSYSKDGIS GWSKPVFDEE LFEPVCMATI
TRLPEKLGGG MLFINPDSRD IPKYPRKNLT ARISNDEGQS WPVKKVIDTG TSGYSDVAVG
ADGTIYCLYE TNSNPGRNFN YSLVLKRFSL NWLTGTSKK