Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_1358 |
Symbol | |
ID | 8252458 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 1614874 |
End bp | 1616814 |
Gene Length | 1941 bp |
Protein Length | 646 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 644935012 |
Product | NHL repeat containing protein |
Protein accession | YP_003091635 |
Protein GI | 255531263 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4632] Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.967304 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATATT TTAATCTTAC CATAAGCAGA ACAGCACTGG TTTTTCTTAT TGTTTCTGTC ATACATTTCA GTTGTACAAA ACAGATTGAT CAGGTCCTGT TAAAAGAGAA ACCGATTCGC AGTGGATCGG CAGCTGTAAC TACACTGTCC TACTCCAGCC TGACACAAAA ACTGATAGAT GAAACGAGTC TTATCGGCAC AGTGATTTCA GATGATGAAA TCACTGTTGC GCCAGGTGTA ACAGAAACAG ATATACATTA TACCGATACC GCCGGCAAAG CCATGCACCT GTTTATTCTG AAAGTTAACC TCAACGAACC CCAGGTATTC ATGGAAGTAG CTACACCTTT TAATCTTCCG GCATATGCCA GGCAAACCGT ACCTGCACAG GCGGCCGAAA TTGATACCGC TACCCATATG GTGATAGCAG GCATAAACGG CGATTTTTTT GATACCAGCA CCGGTATTCC TATGGGTATT GTACATAAAA ATGGTAGCAT CGTCAAAAGC ACTTTTAATG ACAATACCCT GAAACCCCAG CAGGCAGTCA GTTTCTTTGG CGTAACAGAA AATAATGTTC CGATTATCGA TTTTAAAAGT GGTTATGCCG CCCTCAGCAG CCAGCTTTAC AACAGTACTG GCAGCGGAGT AATGCTGGTA AACAATCATC TTCCTGTTTC ACAACCTTAC ACGGCAATCG ATCCCAGAAC ATCAGTTGGA TATGATGACA ACGGCATCGT ATATTTTGTA GTAATAGATG GACGAGACGC CCCCTATTCC AATGGAATGA ACTATGCACA GCTTACCAGT GCTTTTATGG CTTTTAATGT AAAAAATGCT GTAAACCTGG ACGGCGGCGG CTCCTCTACT TTTATGACCA GAAACCCGGT AACCAATTTA TTACAGGTAA GGAATCAGCC TTCAGACGGT ACCGCACGTG CCGTTGCCAA TGCCTGGCTG GTCTATATCA GTAAGGTGCT GGTCAGCAAT TACGCTGGTA CAGGAACTGC CGGCTTAGTG AACGGAGCTA AAGCCAGTGC CCGTTTCGAT AGCCCCGAAG GTCTTGCTAT TGATGCATCA GGTAATATGT ACATTGCAGA TAAAAACAAC AATGTGATCC GGAAAATCAC TTCTACCGGA ACAGTGAGTA CCTTTGCAGG TACCGGAGTG GCGGGCTTTG CAGATGGGGC CGGCAGTATA GCTAAATTTA ACGGACCATG GAAAGTTGCT GTCGATGCAA CAGGCAATGT ATACGTCGCA GACAGGGACA ACTTTAAGAT CAGGAAGATC ACTCCGGCAG GTATCGTAAG CACACTTGCC GGAAGTACAG CCGGTTATGC AGATGGAACA GGGAGTGCCG CTAAGTTTAT GCAACCGCTT GATGTGGCCA TTGACCCCTC TGGCAACGTA ATTGTGGCCG ACAATACCAG CCACCGCATC CGCAAAATAA CAGCAGCAGG TGTGGTAACT ACAATTGCCG GAAACGGAAC AGCAGGTTAT ACCAATGGAA CAGGCACAGC TGCACAATTT AAAAACCCAT CAGGTGTAGA TGTCGACGCA TCCGGAAATA TTTATGTTGC CGATCGTTTA AACCATCGGA TCAGAAAGAT CACCACATCG GGAGTAGTCA GTTCCTTAGC TGGCACCGGA ACTTCGGGCA CTACAGATGG CGCGGCTGGT TCAGCCAAAT TTTCAGACCC TTATGGTGTT ACTGTCGATG TATCCGGAAA TGTGTATGTA GCAGACCTGA TCAGTTCAAG GATCAGAAAA ATATCATCCG GCCAGGTTAG CACCTTAGCC GGAACTATAC CTGGTTATCA AAATGGAACA AGCACAATAG CCAAATTCAA TCAGCCTACC GATCTGGTTA TCCAGGGGTC GAACATCTAT ATAGCAGACC ATTCCAACAA CAGCATCCGT CTGGTCAAAT TAATCAATTA A
|
Protein sequence | MKYFNLTISR TALVFLIVSV IHFSCTKQID QVLLKEKPIR SGSAAVTTLS YSSLTQKLID ETSLIGTVIS DDEITVAPGV TETDIHYTDT AGKAMHLFIL KVNLNEPQVF MEVATPFNLP AYARQTVPAQ AAEIDTATHM VIAGINGDFF DTSTGIPMGI VHKNGSIVKS TFNDNTLKPQ QAVSFFGVTE NNVPIIDFKS GYAALSSQLY NSTGSGVMLV NNHLPVSQPY TAIDPRTSVG YDDNGIVYFV VIDGRDAPYS NGMNYAQLTS AFMAFNVKNA VNLDGGGSST FMTRNPVTNL LQVRNQPSDG TARAVANAWL VYISKVLVSN YAGTGTAGLV NGAKASARFD SPEGLAIDAS GNMYIADKNN NVIRKITSTG TVSTFAGTGV AGFADGAGSI AKFNGPWKVA VDATGNVYVA DRDNFKIRKI TPAGIVSTLA GSTAGYADGT GSAAKFMQPL DVAIDPSGNV IVADNTSHRI RKITAAGVVT TIAGNGTAGY TNGTGTAAQF KNPSGVDVDA SGNIYVADRL NHRIRKITTS GVVSSLAGTG TSGTTDGAAG SAKFSDPYGV TVDVSGNVYV ADLISSRIRK ISSGQVSTLA GTIPGYQNGT STIAKFNQPT DLVIQGSNIY IADHSNNSIR LVKLIN
|
| |