Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_3535 |
Symbol | |
ID | 8254656 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 4203170 |
End bp | 4204804 |
Gene Length | 1635 bp |
Protein Length | 544 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 644937186 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003093788 |
Protein GI | 255533416 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.269906 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCCGTA TTCTAAGCAA TATTTTTTCT TTGCTGCTCA TTTTAACTTT TGCAGCATGT AAAAACCAAT CTTCCAATGC AGAAAAAAAG GTTTTTAGCA TGAACCTTGA TCAAAATGTG ACATCCCTTG ATCCCGCTTT TGCGCGCAAC CAGAATGCCA TCTGGATGAT CAACCAGATT TTTAACGGAC TGGTACAGAT TGACAGTTCC TTAAATACCT TGCCCTGTAT TGCTAAAACA TGGGAAATTT CGAAAGACGG CTTAACCTAC ACCTTTCACC TCCGCAATGA TGTTTATTTT CATGACGACC CGATTTTTCC GGGTGGCAAA GGCCGTAAAG CTGTTGCAGC AGATTTCAGA TACAGCTTTT ACAGGCTTAT AGACCCAAAA GTGGCTTCAT CCGGGGGATG GATCTTCAGC GATAAGGTAA AAGACATCAA TAGCTTTATT GCGGTAAATG ACACGACTTT CCAGGTAAAA CTCATGAAGC CTTTTCCTGC TTTTATGAAC CTGCTCACTA CGCAGTACTG CTCAGTGGTT CCGAAAGAAA TTGTAGAGCA TTATGGTAAG GATTTCCGTA GTCACCCTAT AGGCACGGGT CCGTTTAAAT TCAAGTACTG GAAAGAGGAT GAGATACTGG TGATGTTAAA AAACGAACAT TACTGGGAAA AAGAAGGCGA CCGGCAATTG CCCTACCTGG ATGCGGTAAA AGTTACTTTT ATTAGCGACA AGCAGAGTGC CTTCATGAAT TTCATTAAAA AGGACCTGGA TTATTTTGAT AAGGTTGATG GCAGCTACCG TGACGATATT TTAACCAAAA GCGGAAAAAT GACCAGTAAG TACAAAGGCA AATTCAAGCT CAGAAAAGGC CCTTATCTGT GCACAGAATA CGTAGGTATC CTGGTAGACA CCTCGAAGGC CATTGCTAAA AATTCGCCCT TAAAATATAA AAAGGTAAGA CAGGCCATTA ATTATGCGAT TGATAAACCC AAGCTGATCA AATATTTAAG GAACAGCATC GGCATGCCGG CCACCTCTGG ATTTATCCCG CATGGCATGC CTGGTTTTGA CAGTACAGCG GTTAAAGGTT ACCATTACGA ACCGCAAAAA GCAGCCCGCC TACTGGCCGA AGCTGGTTTC CCGAACGGAA AGGGTATGCC GCAGATTACC CTCAGCACTT CTACCACTTA TAAAGACCTG ATCGAATTTA TTCAGGGGGA GCTAAGTGCC ATAGGCATCA ATGTAAAGGT TGATGTAAGC CCAAGTGCCA GTCTGCGCGA CCTGATCTCA AAAAATGGGG TAAACTTTTT CAGGGGCTCC TGGATTGCCG ATTATCCGGA TGGCGAGAAC TACCTGGCCA TGTTCTATTC CAGGAACAAA GTACCGAACG GCCCCAATTA TACAGGTTAT TTTAACGATG AGTTTGACCG TCTTTTTGAA CAGAGCTATT ACGAAAGCGA CAACCAAAAG CGTTACCTGC TGTATCAGAA AATGGATAGA ATGATCGTAG AATATGCCAA CGTAGTGCCG ATACTCTACG ATCAATCGCT TGTGATGACA CAGAACAATA TCAGTGGCCT GCAGGTGAAT GCGTTAAACC TGATGATATT GAAGACTGTC AGGAAGGAAA ATTAA
|
Protein sequence | MRRILSNIFS LLLILTFAAC KNQSSNAEKK VFSMNLDQNV TSLDPAFARN QNAIWMINQI FNGLVQIDSS LNTLPCIAKT WEISKDGLTY TFHLRNDVYF HDDPIFPGGK GRKAVAADFR YSFYRLIDPK VASSGGWIFS DKVKDINSFI AVNDTTFQVK LMKPFPAFMN LLTTQYCSVV PKEIVEHYGK DFRSHPIGTG PFKFKYWKED EILVMLKNEH YWEKEGDRQL PYLDAVKVTF ISDKQSAFMN FIKKDLDYFD KVDGSYRDDI LTKSGKMTSK YKGKFKLRKG PYLCTEYVGI LVDTSKAIAK NSPLKYKKVR QAINYAIDKP KLIKYLRNSI GMPATSGFIP HGMPGFDSTA VKGYHYEPQK AARLLAEAGF PNGKGMPQIT LSTSTTYKDL IEFIQGELSA IGINVKVDVS PSASLRDLIS KNGVNFFRGS WIADYPDGEN YLAMFYSRNK VPNGPNYTGY FNDEFDRLFE QSYYESDNQK RYLLYQKMDR MIVEYANVVP ILYDQSLVMT QNNISGLQVN ALNLMILKTV RKEN
|
| |