Gene Phep_1614 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1614 
Symbol 
ID8252716 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1914500 
End bp1916404 
Gene Length1905 bp 
Protein Length634 aa 
Translation table11 
GC content44% 
IMG OID644935268 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_003091889 
Protein GI255531517 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAAAA AACTATTAGC TATTGGTTTA CTGGCCCTTT TGTATGTTCC TGTTTTTGCC 
CAAAATGATG TAAATATGGG AATTATCCCT GCTCCGGTGT CTGTAAAAAA AGGTGCAGGA
ACTTTTAAAT TGGATAAGAC TGTAGTTTTG ATATCGAATG AGGTTAAAAA TTCGAAATCG
GCTGATCTTT TAAATGCTTT TATTGTAAGC AAGGCCGGCT TTTCTTTAAG AGAAGCCAAA
TCTGCGATGG CCAATGAGCG GGCCATTGTA CTGAGTTCTG CTGCTGCGGA ACAATTGCCT
GCCGAAGGTT ATAAAATAAG CATAAACCCT AAAACCATTA CCATTACCGG AAAGGCTGCG
GGCTTATTTT ATGGTGTACA ATCTGTTATG CAGCTGATGC CGGACAAACA GCACAATGAG
ATCCTGATTC CTGCAGCTGA AATTAACGAT TACCCAAGGT TTAAATACAG GGGCCTGCAC
CTGGATGTTG GAAGGCACGT GTTCCCTGTT GCTTTCATCA AGAAGTACAT CGACCTGATG
GCCCAGTATA AACTGAACAA TTTTCACTGG CACTTAACTG AAGACCAGGG CTGGAGGATT
GAGATTAAAA AATACCCTAA ATTAACCGCT ACCGCTGCCA GCAGAAACGG AACGATTATC
GGGCACTACC CAGGTGTAAA TAACGACGGA GAAGTTTATA AAGGTTTTTA TACGCAAAAT
GAAGTAAAGG AAGTTGTAGC CTATGCTATG GCGAGGTTCA TCAACGTAAT CCCGGAAATT
GAAATGCCGG GACATGCGAG TGCGGCAATT GCGGCTTACC CTGAGCTGAG CTGTTTCCCG
GATAAAGACA CCTTTGTGGC CGACGTTACA CCCTGGGCAG GTTCGCGTAA AGGTAAGCAG
GTACAGCAAA CCTGGGGCGT ATTCGACGAC GTATTTGTAC CCTCGGAAAA TACCTTTAAG
TTTTTGGAAG ATGTTTTAGA TGAGGTCATT GCGCTTTTCC CTTCTAAATA CATCCACATC
GGTGGGGATG AATCTCCTAA GAAATACTGG GAGCAAAGTG AGTTTTGCCA GAAATTAATC
AAACAGCTGG GCTTAAAAGA TGAGCACGAA CTGCAGAGTT ATTTCATCCA GCGCATTGAA
AAATATGTAA ATTCTAAAAG CAGGAGCATC ATAGGCTGGG ATGAAATTTT AGAAGGAGGG
TTGGCACCCA ATGCAACTGT AATGTCCTGG AGAGGTGTAA AAGGGGGGAT TGCAGCAGCG
CAGCAAAAGC ATGAAGTGAT CATGACCCCA AATGCAGGTG GTCTGTATTT TGACCACAAG
CAATCTGAAT CTGCCGATGA ACCCACCAAT ATCGGTGGTC TGGCACCTTA TTCCAAATCA
TACAACTATG ATCCGGTACC TGCCGAACTT GCACCTGATG AACAGAAATA TGTGATAGGG
GTGCAGGCAA ATGTCTGGAC AGAGTACATT CAAAGCGCTG CCAAAGTGGA GTACTTTTTG
CTGCCCAGAC TGTTCTCTTT ATCGGAGATT GCGTGGAGCC AGGCTTCAGG TAAAGACTTT
AAGAATTTTT CGGAAGAGCG CTTGCCTTTG CACCTCTCGA GGCTGGATAA AACAGGTACC
AATTACTGGG TACCCACACC CTTAGGCCTG AATCAGAAAG TGCTGAACGG TGAAGATTTC
AGCATTACCT TGAAAGAACC CATCCCGGGA GCTAAGATTT ATTATACCTT AGACCTGACC
CGCCCTTCAG AAATAGGGGA ACTGTATACC AAACCCATTA AAGTAAAAGT ACCGAAAGGC
CAGAAACAGA TTTTGAAAAC CATTGTTGTT ACAGCAGCTG GTAAACGGAG TGTGGTAACT
GAAACTATTT TAAACAACGG AAGTGCTGAA GTGGCCGCTA AATAG
 
Protein sequence
MMKKLLAIGL LALLYVPVFA QNDVNMGIIP APVSVKKGAG TFKLDKTVVL ISNEVKNSKS 
ADLLNAFIVS KAGFSLREAK SAMANERAIV LSSAAAEQLP AEGYKISINP KTITITGKAA
GLFYGVQSVM QLMPDKQHNE ILIPAAEIND YPRFKYRGLH LDVGRHVFPV AFIKKYIDLM
AQYKLNNFHW HLTEDQGWRI EIKKYPKLTA TAASRNGTII GHYPGVNNDG EVYKGFYTQN
EVKEVVAYAM ARFINVIPEI EMPGHASAAI AAYPELSCFP DKDTFVADVT PWAGSRKGKQ
VQQTWGVFDD VFVPSENTFK FLEDVLDEVI ALFPSKYIHI GGDESPKKYW EQSEFCQKLI
KQLGLKDEHE LQSYFIQRIE KYVNSKSRSI IGWDEILEGG LAPNATVMSW RGVKGGIAAA
QQKHEVIMTP NAGGLYFDHK QSESADEPTN IGGLAPYSKS YNYDPVPAEL APDEQKYVIG
VQANVWTEYI QSAAKVEYFL LPRLFSLSEI AWSQASGKDF KNFSEERLPL HLSRLDKTGT
NYWVPTPLGL NQKVLNGEDF SITLKEPIPG AKIYYTLDLT RPSEIGELYT KPIKVKVPKG
QKQILKTIVV TAAGKRSVVT ETILNNGSAE VAAK