Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_2756 |
Symbol | |
ID | 8253864 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 3257735 |
End bp | 3258895 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 644936404 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003093019 |
Protein GI | 255532647 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.196216 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGACA AAATTAGTTT AAAAGGAATA ACATGGAACC ACAGCCGTGG CTTTACCCCA ATGGTAGCTA CAGCACAGCG TTTTTCAGAG TTGAATCCAT CAGTAGAAAT TATATGGGAG AAAAGAAGTT TACAGGCTTT TGCTGATTTT TCCATACAGG AGCTGGCCGA AAGATTTGAT CTTCTGGTAA TTGATCATCC CTGGGCAGGT TTTGCCGCCA AAACAGGATC CATTCTTCCG CTGGACAAAT ATTTTACAAA GGATTATTTA AAAGACCAGG AAGACAATAC AGTAGGACAT TCTTATGAGA GTTACAGTTA CGACGGACAC CAGTGGGCAC TCCCGATAGA TGCGGCGACA CCCGTAGCGG CCTGCCGACC GGATTTACTG GAACAGCATG GCCTTGCTTT GCCAAAAACA TTTGAGGATT TACTGAACCT GGCCGATAAA GGCCTGGTGG CTTTTGCGGG GATCCCCATT GATGTACTCA TGAATTTTTA TACATTTTGC TGCTCGCTGG GCGAAGATCC CTGCCAGCAG GAGGACCTGA TCATATCCAG TGATATTGGT ATTGCCGCAC TCAGGATGTA CAGAGAACTG GCTTCAAAGA TTCATCCCGA TAATTTTAAA AGAAATCCAA TACAGACATA TGAAGCCATG ACCTTAGGTG ATGATATTGC TTATTGTCCT TTTGCTTATG GGTATTCCAA TTATTCCAGA AAAGGTTACG CCCGCAAACT GCTGCATTTT CATGACATGA TCTCTTTGAA TGGGCGCAGT AACCTGCGCA GTACGCTTGG AGGAACCGGC CTGGCCGTAT CTTCAGCTTG TAAGCATATA GAAATGGCTG TAAAATATGC AGGATATGTA GCTTCGCCAT TCTGGCAACA GGGGCTTTTC TTCGAAAATG GTGGTCAGCC CGGCCACTTA AGTGCCTGGA CAGATGCAGA AGTGAACCAC AGGTCGAATG ATTTTTTTGT GAATACGCTG CCAGCACTGC AACGGGCTTT CTTGCGACCG CGTTACCACG GGCACATGTT TTTCCAGGAC CATGCCGGGG ATATCGTGCG TGATTACCTC ATGATGGGAG GATCTGAAAT ATCAGTACTG GAAAAGCTGA ATAGTTTATA TGTAGAATCC AGAACCCTGC AGTTGTCATG A
|
Protein sequence | MSDKISLKGI TWNHSRGFTP MVATAQRFSE LNPSVEIIWE KRSLQAFADF SIQELAERFD LLVIDHPWAG FAAKTGSILP LDKYFTKDYL KDQEDNTVGH SYESYSYDGH QWALPIDAAT PVAACRPDLL EQHGLALPKT FEDLLNLADK GLVAFAGIPI DVLMNFYTFC CSLGEDPCQQ EDLIISSDIG IAALRMYREL ASKIHPDNFK RNPIQTYEAM TLGDDIAYCP FAYGYSNYSR KGYARKLLHF HDMISLNGRS NLRSTLGGTG LAVSSACKHI EMAVKYAGYV ASPFWQQGLF FENGGQPGHL SAWTDAEVNH RSNDFFVNTL PALQRAFLRP RYHGHMFFQD HAGDIVRDYL MMGGSEISVL EKLNSLYVES RTLQLS
|
| |