Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_4104 |
Symbol | |
ID | 8255238 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 4950713 |
End bp | 4952929 |
Gene Length | 2217 bp |
Protein Length | 738 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 644937768 |
Product | glycoside hydrolase family 3 domain protein |
Protein accession | YP_003094357 |
Protein GI | 255533985 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.28211 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGCTAT TTCAATTTAT ACTATTCTTC TTTACAGGTC TGCAGCTTTT AAAAGTACAG GCCCAGGAAA AAACCGGTTA TATTCCCTTA ACCAAACAAG AAAGGCAAAA AGTAGAACTC CTGCTGAGTA AAATGACCTT GGAAGAAAAA GCGCATCAGC TGGCCTCATT TTATCCCAAT GCCAATAAAA GATTAAATAT CCCCCATATG CAGGCCGGTG AATGTCTGCA TGGTGTGGTT GCTGCCGGCA CCACTTCTTT CCCTCAGGCC ATTTCCATAG CCAGTTCCTG GGATCCTTCG CTTGTCGAAA GGGTATCTAC CGTGATTGCA AAAGAAGCCA GGGCTTTAGG CATACACCAC TGTTATACCC CAATGCTTGG GGTTTTGCGC GATGCGCGCT GGGGCCGTTT CGAAGAAGGT TATGGAGAAG ATGCCTACCT GGTCAGTAAA ATCGGCGTAG CCTTTATCAA TGGCCTGCAG GGCCGTGGCA AAAACCGCTT CGATAAGGAC CATGTAGTGG CAACAGCTAA ACATTTTGTG GCCGATAGTG AACCCCTGCT GGGTGCCAAT GGTGCTGCAG TCGAAATTTC CCTGCGTAGT TTGCACGAAG TTCACCTTCC GCCTTTCCGG GCTGCAGTAG AAGAAGCTCA GGTTGGTTCG GTCATGCCTG CACATCATAC CTTAAATGGG GTGCCCTGTC ACATCAATAC CTATACCCTA AACGATGTAT TCAGAAAGGA ATACGGCTTT GATGGTCTGG TGGTTTCTGA TAACAACGAC CTGAGGTGGG TTCAGGAGCG CTTGTTCGCC ACCGAAAGCC AGGAAGAAAC CATCAGAAAA GCACTGGAAG CAGGTGTGCA TACCGAGCTT GCCTTTAAAC AGACCTGGGC CGATAAAAGA ATGTATGGCC CCCCACTGGT CGCCGCGGTA AAAAACGGAA AAGTGCCGGT AAAACTGCTC GACGACGCCG TTAGAAAAGT ACTGGAATTT AAGATTGCCC TGCACCTCGA CGAAGAAGAA AATCCATTGG GCAAGGAAAT GACCGAATTA CAAAAAGGTA CAAAAGATGC AGATGTAAAT GCTGATGTAT TCTTTTCGCA GATCGATGGC TCATTGTCCA GCCCCAGATC AAACTATAAA ACCGTACTAA ATAATCCTGT ACACGATGCA CTTGCACTCG AAGCAGCCCG CAAAAGTCTC ATCCTCCTAA AAAACAACAA CCTGCTGCCA TTTAAAAAAA GTCAGTTCAA AAAGATAGCC GTAATTGGTC CAAATGCCGA TACCATTCGC CTGGGCACTT ATTCTACCCA GCAGCCTAAA CACTTCATTA CTGTAAAACA AGGCATCGAA ACTGCTGTAG GTAAAAATGC ACAGGTATTG TATGCGAAAG GGACTGATAT CCAGCATCCA AAAGATACGC AGCTTGCAGA AGCCGTTGCC ATTGCAAAAG AAGCTGATGT ATGTATCCTG GTGCTGGGCG ATGATGATAA AACCGTAATG GAAAATGTGG ATAGGGACGA CATTACCTTG CCGGGCGACC AGGATAAGCT GATGCAGGCC ATTGTAGCCA CAGGCAAACC TGTAGTACTG GTATTGCTGC ATGGCCGTCC GGCCGCTATT CAATGGGCCA AAGACCATGT TCCGGCCATA TTAGACGGAT GGTTTCTGGG GCAGGAAACA GGTACTGCCA TTGCAGAAGC CATATTTGGC GATCTGAACC CTTCCGGAAA ATTAACTGTT ACCTACCCAA GAAATGTAGG TCAGGTACCT GCATTTTATA ATACTTTAAT ACCAGGCAGG CCAAGAATGA TGTGGGGAAC TACAGAAGGT GCAACCTATC CCTTTGGTTA TGGCATCAGC TACACACAAT TTAAATATGG AGTACCAAAA CTCTCTAAAG CCAGCATGAA AGCCAGTGAA ACTGTTTTTG CCGAAATCGA AGTAACCAAT ACCGGTAAAG TGGCTGGCGA TGAAATTGTG CAGCTGTACC TTCGTGATGA CATCTCTTCA CTGGCAAGGC CAATTAAAGA ATTAAAAGGG TTTAAACGCA TTAGCCTGCG TCCGGGCGAA ACCCAAAAGA TTTCCCTGCC CATTTCTTCC CGTTCGCTTG AATTCTGGAA AGATGGCAAA TGGATTACCG AACCTGGCAG TTTCACAGTC ATGATGGGCC CAAATTCTGA AGAACTGAAA ACCATTAAAT TAGAACTGAC CCAATAA
|
Protein sequence | MRLFQFILFF FTGLQLLKVQ AQEKTGYIPL TKQERQKVEL LLSKMTLEEK AHQLASFYPN ANKRLNIPHM QAGECLHGVV AAGTTSFPQA ISIASSWDPS LVERVSTVIA KEARALGIHH CYTPMLGVLR DARWGRFEEG YGEDAYLVSK IGVAFINGLQ GRGKNRFDKD HVVATAKHFV ADSEPLLGAN GAAVEISLRS LHEVHLPPFR AAVEEAQVGS VMPAHHTLNG VPCHINTYTL NDVFRKEYGF DGLVVSDNND LRWVQERLFA TESQEETIRK ALEAGVHTEL AFKQTWADKR MYGPPLVAAV KNGKVPVKLL DDAVRKVLEF KIALHLDEEE NPLGKEMTEL QKGTKDADVN ADVFFSQIDG SLSSPRSNYK TVLNNPVHDA LALEAARKSL ILLKNNNLLP FKKSQFKKIA VIGPNADTIR LGTYSTQQPK HFITVKQGIE TAVGKNAQVL YAKGTDIQHP KDTQLAEAVA IAKEADVCIL VLGDDDKTVM ENVDRDDITL PGDQDKLMQA IVATGKPVVL VLLHGRPAAI QWAKDHVPAI LDGWFLGQET GTAIAEAIFG DLNPSGKLTV TYPRNVGQVP AFYNTLIPGR PRMMWGTTEG ATYPFGYGIS YTQFKYGVPK LSKASMKASE TVFAEIEVTN TGKVAGDEIV QLYLRDDISS LARPIKELKG FKRISLRPGE TQKISLPISS RSLEFWKDGK WITEPGSFTV MMGPNSEELK TIKLELTQ
|
| |