Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_0002 |
Symbol | |
ID | 8251086 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 1284 |
End bp | 2153 |
Gene Length | 870 bp |
Protein Length | 289 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 644933651 |
Product | hypothetical protein |
Protein accession | YP_003090290 |
Protein GI | 255529918 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4632] Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTAAAA AATCTTTCCT TACTCTTGTT GTGCTTTGTG GCTACGGTTT AGCTTTTGCA CAAAATACAG ACTCACTTAA CCTGGTCCAT ACCAAATGGG ATAAACAGCG CCTGGCCCGG AAAATCAAAC TGGTTACCCA CCATTTTAAT GCTAAAGATC TTTTTCTGGC CAACCAGAAC ATCAGCTACC TGGAAATTAA AAACAAAGGC AGATCGCCTG TACTGGCCAT CTCCGCAGAA GAAAAAGTGT TAAAAACGAC CAGTACCTTT GGCACAGAAA ACAATGCGCT GGCCGCTGTT AACGGTTCTT TTTTTGATGT TAAGAATGGC GGGTCGGTAG ATTTTATAAA AGTGGGGGGC AAAGTGCTGG CCGAAAACCG GCTCGAAAAA AATGATAGCC GCGCAAGGCA CCAGCAGGCA GCTGTAGTGA TCAGCAATGG TAAACTGGCC TTAAAAAAAT GGGACGGTAC TGCCGACTGG GAGCAACGCT TAACAGAAGA AAATGTTTTG CTGAGCGGCC CCCTGCTGAT GTTAAACGGT ACGGACGAGG CGCTCGACTC TACCAGTTTT TCACGGTCAC GGCATCCGAG AACTGCCATC GGCATTAAGC CGAACGGAAG AATCCTTTTA CTGACGGTTG ACGGCAGGAA CAGCAACTCG GCAGGAATGA GTTTAACAGA ACTGGCCAAA ACGATGAAAT GGCTGGGCTG TACCAGCTCC ATTAACCTGG ATGGCGGGGG CTCTACAACC TTATGGGTAA GTGGTTTTCC CGGGGGTGGA GTGGTGAATT ACCCAACGGA CAATAAACTT TGGGACCATG CCGGACAACG GAAAGTAGCA AATGTGATCC TGGTGAAAAA GACGCGCTGA
|
Protein sequence | MLKKSFLTLV VLCGYGLAFA QNTDSLNLVH TKWDKQRLAR KIKLVTHHFN AKDLFLANQN ISYLEIKNKG RSPVLAISAE EKVLKTTSTF GTENNALAAV NGSFFDVKNG GSVDFIKVGG KVLAENRLEK NDSRARHQQA AVVISNGKLA LKKWDGTADW EQRLTEENVL LSGPLLMLNG TDEALDSTSF SRSRHPRTAI GIKPNGRILL LTVDGRNSNS AGMSLTELAK TMKWLGCTSS INLDGGGSTT LWVSGFPGGG VVNYPTDNKL WDHAGQRKVA NVILVKKTR
|
| |