Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_4047 |
Symbol | |
ID | 8255181 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 4890369 |
End bp | 4891637 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 644937711 |
Product | glycosyl transferase group 1 |
Protein accession | YP_003094300 |
Protein GI | 255533928 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATAA CCTTGATCAA TACATCCGAT GCAGGCGGGG GTGCTCCCGT AGCCAGTATG CGCCTGTTAA AAGCGCTGGC ATCAGAAAAC ATCGAGGTAA ACTTCCTGGC CGGCGATAAA AAGACCAGCG AGGCCCGCGT TCAGGCCGTT CAAAATAACT TATGGCAGCG GTTAAAGGCC CGTTTTAATT TTTTATATGA ACGCCTGCCC TTTATCCTTT TTTATGAAAA GGACAAATCT GTACGCTTCG CCTTTTCAAC GGCCAATGCC GGAACCAGCA TTGCCGGTCA TCAGCTGGTT AAAGAGGCCG ACATTTTACA CCTGCACTGG ACCAACTCCG GCTTTCTTTC TATCCGAGAT TTAACAGAAC TGATGAAACT GGGCAAACCG GTAGTGTGGA CCCTCCACGA TATGTGGGCA TTTACAGGGG GCTGCCATTA TGCGGGCACC TGCGACCATT TTACAGGGGA ATGCGGCGAT TGTTACTTTT TAAGAGACCC TGCCGGGAAC GACATTTCCC ATACCGGCTG GCTGCACAAG CAGGCCATGT ATGCCAGCAG CCCAAACCTC ACTTTTGTAA GCTGCAGCAA CTGGCTGGGC AAGGTGGCCC GTCAAAGTTC ATTAGTAAAG GATTTTCCTG TCCGGGCCAT TGCCAACCCC ATCGATATAG AGGTATATGC CCCTAAAGCC AGGATGGCTG CCCGTACCAA ATGGAACATC AGCAACAACA GCAGGATCAT TTTATTTGGT GCAGCCAATA TAGGCGACCG CCGTAAAGGG ATTGGCTATC TGGTACAGGC ACTGCAGCAT TTAAAGCAAC ATTACGAATT GCCCCTTCCG GTTGAGGTAC TGATTTTCGG GAAAAACAAA CACTTTGATG TCAGCCAGCT CCCCTTTCCT GTTCATGAGC TGAATACCAT TAGCGCAGCA CAGGACCTGG CCGAACTGTA CAGCCTGGCC GATGTATTTG TAATGCCTTC TGTTGAAGAC AACCTGCCCA ATACCGTTAT GGAATCTATG GCCTGCGGTA CCCCCGTAGT GGCTTTTGAT ACAGGTGGCC TGCCGGAAAT GATAGACCAC CAGTTAAATG GCTACCTGGC CAGTTTCAGG TCGGCCACAG ATCTGGCCAC AGGCATCCAT GAAATGCTCT TTACCCCCCG GGCAGAAGAA ATCGCTATAC AGGCCCGGAA CAAAGTGCTC CAAAATTTTA CCAATCAACA CATCGCCAGA CAATATACAG ACCTCTACCA ATCCCTACTT AGCAAATGA
|
Protein sequence | MKITLINTSD AGGGAPVASM RLLKALASEN IEVNFLAGDK KTSEARVQAV QNNLWQRLKA RFNFLYERLP FILFYEKDKS VRFAFSTANA GTSIAGHQLV KEADILHLHW TNSGFLSIRD LTELMKLGKP VVWTLHDMWA FTGGCHYAGT CDHFTGECGD CYFLRDPAGN DISHTGWLHK QAMYASSPNL TFVSCSNWLG KVARQSSLVK DFPVRAIANP IDIEVYAPKA RMAARTKWNI SNNSRIILFG AANIGDRRKG IGYLVQALQH LKQHYELPLP VEVLIFGKNK HFDVSQLPFP VHELNTISAA QDLAELYSLA DVFVMPSVED NLPNTVMESM ACGTPVVAFD TGGLPEMIDH QLNGYLASFR SATDLATGIH EMLFTPRAEE IAIQARNKVL QNFTNQHIAR QYTDLYQSLL SK
|
| |