Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_1201 |
Symbol | |
ID | 8252299 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 1421756 |
End bp | 1423009 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 644934856 |
Product | Chorismate binding |
Protein accession | YP_003091481 |
Protein GI | 255531109 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.798277 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.539184 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCTGC AGGATTTATA TTCATTTAAG CAAAAAGCGC TGACCTGGGC CAGTTCCTTT GAAGTATGCT GTGTGCTCGA TTCCAACGAC TATACCGACC CTTACAGTGC ATTTGATCTG ATACTGGCAG TAGGGGTAAA AGAAGAGTTA AATGCGCGGT CTGAAGATAA GTTTACGGAA CTGAAAGCAT TTTACGAACA ACACAAAAGC TGGATGTTTG GCTTATTGAG TTACGACCTG AAAAATGAGG TAGAACAGCT CAGCTCTTGC AATGCTGATC AGCTTGGCTT CCCCGACCTG TTTTTCTTTG TACCACGTTA CCTGCTGAGG GTTAAAGACG GAAAAGCAGA AGTACTGGTT GGCGAGCCAG CAATTCTTGA TACCATTGAC CGTATATCCC CACTTATTGC CCCCTATAAT AAAGATATTT CTATACAAAG CCGGATTTCG AAAGAACTGT ACCTAAAAAC CGTTGAAACG TTACAGCAAC ACATTGCACG CGGCGACATT TATGAAATTA ATTTTTGTCA GGAATTCTAC GCCTTGCACG CTGCCGTAGA TCCTGTAGCC ATTTACTGCA AACTGTCAGA AGTTTCACCC ACCCCGTTTT CGGGCTTCTT TAAAATAAAA GACCAGTATA TCCTCTCTGC CAGTCCGGAA AGGTTCCTTT GTAAAAGGGC CCGTCAGCTC ATTTCGCAGC CCATTAAAGG TACAGCAAAG CGAAGCCCGG ATAAGGAAGA AGATGCCCGC ATTAAAAATA AACTGCGTAA AAATATAAAA GAACAGGCAG AAAATGTGAT GATTGTAGAC CTGGTGCGCA ACGACCTGAC TAAAAGTGCC ATAAAAGGAA GTGTAAAAGT AGATGAACTT TTTGGCATCT ACGGCTTTCC GCAGGTTTAC CAGATGATCT CTACCATTAG CAGTAAACTG GACCCTGCTG TCCATTTTGT TGATGCCATT AAACAGGCTT TCCCTATGGG CTCTATGACA GGGGCACCAA AGGTTAAAGC CATGGAACTC ATAGAAACCT ATGAATCCAG CAAGCGGGGT GCCTATTCAG GAGCAATGGG CTATATCAAT CCCCAGGGTG ATTTCGATTT TAATGTCATC ATACGCAGTA TATTGTATCA TGCTGATACA CGCTACCTGT CCTTTCAGGT AGGCGGGGCT ATTACCTTTG CATCCAATGC AGCGGATGAA TATGAAGAAT GCCTGCTCAA GGCATCAGCA ATTATCCAAA CCCTTAAAAC ATAA
|
Protein sequence | MSLQDLYSFK QKALTWASSF EVCCVLDSND YTDPYSAFDL ILAVGVKEEL NARSEDKFTE LKAFYEQHKS WMFGLLSYDL KNEVEQLSSC NADQLGFPDL FFFVPRYLLR VKDGKAEVLV GEPAILDTID RISPLIAPYN KDISIQSRIS KELYLKTVET LQQHIARGDI YEINFCQEFY ALHAAVDPVA IYCKLSEVSP TPFSGFFKIK DQYILSASPE RFLCKRARQL ISQPIKGTAK RSPDKEEDAR IKNKLRKNIK EQAENVMIVD LVRNDLTKSA IKGSVKVDEL FGIYGFPQVY QMISTISSKL DPAVHFVDAI KQAFPMGSMT GAPKVKAMEL IETYESSKRG AYSGAMGYIN PQGDFDFNVI IRSILYHADT RYLSFQVGGA ITFASNAADE YEECLLKASA IIQTLKT
|
| |