Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_3633 |
Symbol | |
ID | 8254764 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 4343017 |
End bp | 4346049 |
Gene Length | 3033 bp |
Protein Length | 1010 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 644937294 |
Product | TonB-dependent receptor |
Protein accession | YP_003093886 |
Protein GI | 255533514 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAGAA TCTTTACAAA ACTTTCAGTT TTAACTTTTC TCTGTTTTCT TTTTACGAGC GTAACACAAG CACAGGACCT TACTGTAACT GGTGTAGTAA CAGATTTAGC AGATAAACTG CCGCTACCTG GTGTCAGTGT ACAGGTAAAA GGAACCCAGA AAGGAACAAC CACAGATGCA ATGGGCAAAT ATGCCATCAG CGCACCTGCA AATGCAACCC TGGTTTTCAC ATCCATTGGG TATACCAGCC GTGAAATGCA GATCGGCAAT CAAACCACAA TTAATGTCGT GCTTTCCTCT GCCTCCCAAG ACCTTGAAGG TGTAGTTGTA GTAGGTTATG GCACGCAAAG AAAACGTGAT CTAACAGGAG CAATTACACA GATCAAAGGC GACGAAGTGG CAAAAATGCC AAACACCAAT CCTCTATCTT CTTTACAGGG TAAAGTAGCC GGTTTAACCG TTGTAAACAG TGGCACACCC GGTGCCGCAC CTACCGTAAG GATCAGGGGG GTAAACAGCA CAACATCAGG AGGCAACAAT CCTCTTTATG TAGTTGACGG CGTGCATCAG GACAATATCG ATTACATCAA TCCCGCAGAT ATAGAAAGCA TTGAAGTATT AAAAGACCCT TCTTCTATAG CTATTTTTGG TTTGCAGGGA GGAAACGGTG TGATTGTAGT TACCACAAAA CGGGCAGCTA AAGGACAAAC CACCATTAAC CTTCAAAGTT CTGCCGGTGT ACAAAAAGTA TTGAATACCA TTGATGTAAC CAATGCAGCA CAATTCATTA AATTGTACAA CAACCTACTC GCTAATTCCG GATTGGGATC TTACGATTAT ACCAATTATA CTGCAGATAC CGATTGGCAG AAAGAAATCC TCCAATCAGC CTTTCAAAGC AACAATAACC TGAGCATTTC CAATAGCGGT GAAAAATCAA CCACATTGAT CAACCTGGGA TACAATACAA TGGAAGGTGT AGTTAAATTT GGAAAGTATC AAAGATATGT AGCCCGGGTA AACGAGGAAA TAAGAATAAA TGATAACATT AAAATTGGCG CAGATATAAC AGGAACACAC TGGATTCTGA ACAACTCGAG CGGAGATCTG AACAATGCAT TATGGGCAGC CCCGATAGTT GGCATCAGGG AAAGTGAAAC TGCTTATTAT GCCATGCCTG GATTTCAGCG GGGCCAGGTT GGTAATCCGG TAGCCAGGAT TTACCAAAAC GACAGAAACA GCATTAACAA AGGTTACCGG GTAGTCGGAA ACCTTTTCGC CGATGTTACT TTCCTGAAAA AATTTAAATG GAGGTCGCAG TTTTATACCG ACCTTGGTTT CAACAATAAC CGGGGATATA CAGGCCTGCC CTTTACTGTT ATCTATTTAG GAGAAGGCAA TATCCCTACC ACCAGATTCG ACAATCCAAA TGCCCGTACT TCGGTAAGAC AGGGAGCTGA CGAATTCAGA AAGTTTCAGC AGGACCACAC AATAACTTAT GAAAACACCT TTAATAACGA CCATAAGGTA ACAGCTGTTG CAGGTTTTAC TTCTATCTTC AGAAGTGAGA CTAAACTTTC CGGGGACCGG ACAGACATCG GGCTAAATAT TCCAGATGAC CCTGCTTACT GGTACATTGG CATTGCCGAC AAAAGCAATC CAAGTGGTGT AACCGGTAGT GGTGAAGAAA GGGCCTCAAT GGGTTATTTT GCAAGGGTAA ACTATGCTTA TAAAGACAGA TACCTGATAA ATGCCACTTA TAGAAGAGAC GGGCTGTCCA GTATTGCCCC TCAAAACCGC TGGGGTAATT TTGGCGGTAT TGGTTTGGGA TGGGTGCTCT CCGAAGAAAG CTTCTTCAAA AACATTAAAG GTGTTGACTT TTTGAAACTT CGTGGTTCAT GGGGAACAAC AGGTAACGGA CAGGGCTTAC CTCCCAATAT CTTCAGACCA GGTGTTACCA CCTCTGGCTC GGGCGTGTTC GGCGACAACA TTTATCCCGG CATTGCACCT GCCTATATTG CAGATCCAAA CCTGAAATGG GAAGTAGTAA GAGGATTAGA CTTGGGAATG GATCTGAAAG CTTTGAACAG CCGCTTAAGC GCAGAAATTA ATGTATATGA CCGTACAACA AAAGACATCA TTACCCAGAT CACTTTATTG AATACTAGCG GAAGCTATCC TTACCGTACC AATCTGGGCA CTATATCCAA TAAAGGAATT GAGGTGGCCC TGGGGTGGAA TGATAAAATC GGCAGTGATT TCACCTATAA TATCACACCT AATTTTAGTT ACAATAAAAA CGAAGTCGTA TCCATCGGAA ATAACATTAA CTTCCTGCTA ACCGGAAATG GCGGTGCAAA CAGGACCATT ACCGGAGAAT CGATAGGTCA CTTTTATGGA TATAAGCAAA TAGGCATTTA TCAATCGACT GCCGATCTGG ACAAAATGGC CAGGCTATCC AACTCACTCC CTGGTGATAT TGCCTATCAG GATACAGATG GTGATGGTAA GATCACACCT GCCGACCGGA TTAAGCTAGG TTCTCCCTTT CCTGCCTGGA GTTATGGCCT GAACCTGAAC TTAGGCTACA AAGGTTTCGA TGTTTTGTTA CAGGGACAAG GTGTTGCCGG CAATAAAGTA TACACCCAGC GCCGTACCGC AACTTTTGCT GACCTGAACT TTGAAACCAA CAGATTAAAT GCCTGGACAG GGCCAGGAAC CAGTAACGTT GAACCCATTT TACAGAAAGG CAGACTCAAC AACTACCTGT TCAGCAGCTA TTACCTGGAG CCGGGAGATT ACTTCCGTCT GCGTACCGTA CAATTGGGTT ATACCTTTAA ACCAGCTATG CTGGCAAAAG CAGGTGTTAA AAACCTCCGT TTGTATGTGA GTGGACAGAA CATCCATACC TGGACAAAAA CTACCGGATA TTCGCCTGAA GCACCGATCA GTGATGTACT TGGTGGTGGT GCTGACAATG GAGTATATCC AATTCCGGCA GTTTACACAT TTGGTATCAA TGCAACATTT TAA
|
Protein sequence | MKRIFTKLSV LTFLCFLFTS VTQAQDLTVT GVVTDLADKL PLPGVSVQVK GTQKGTTTDA MGKYAISAPA NATLVFTSIG YTSREMQIGN QTTINVVLSS ASQDLEGVVV VGYGTQRKRD LTGAITQIKG DEVAKMPNTN PLSSLQGKVA GLTVVNSGTP GAAPTVRIRG VNSTTSGGNN PLYVVDGVHQ DNIDYINPAD IESIEVLKDP SSIAIFGLQG GNGVIVVTTK RAAKGQTTIN LQSSAGVQKV LNTIDVTNAA QFIKLYNNLL ANSGLGSYDY TNYTADTDWQ KEILQSAFQS NNNLSISNSG EKSTTLINLG YNTMEGVVKF GKYQRYVARV NEEIRINDNI KIGADITGTH WILNNSSGDL NNALWAAPIV GIRESETAYY AMPGFQRGQV GNPVARIYQN DRNSINKGYR VVGNLFADVT FLKKFKWRSQ FYTDLGFNNN RGYTGLPFTV IYLGEGNIPT TRFDNPNART SVRQGADEFR KFQQDHTITY ENTFNNDHKV TAVAGFTSIF RSETKLSGDR TDIGLNIPDD PAYWYIGIAD KSNPSGVTGS GEERASMGYF ARVNYAYKDR YLINATYRRD GLSSIAPQNR WGNFGGIGLG WVLSEESFFK NIKGVDFLKL RGSWGTTGNG QGLPPNIFRP GVTTSGSGVF GDNIYPGIAP AYIADPNLKW EVVRGLDLGM DLKALNSRLS AEINVYDRTT KDIITQITLL NTSGSYPYRT NLGTISNKGI EVALGWNDKI GSDFTYNITP NFSYNKNEVV SIGNNINFLL TGNGGANRTI TGESIGHFYG YKQIGIYQST ADLDKMARLS NSLPGDIAYQ DTDGDGKITP ADRIKLGSPF PAWSYGLNLN LGYKGFDVLL QGQGVAGNKV YTQRRTATFA DLNFETNRLN AWTGPGTSNV EPILQKGRLN NYLFSSYYLE PGDYFRLRTV QLGYTFKPAM LAKAGVKNLR LYVSGQNIHT WTKTTGYSPE APISDVLGGG ADNGVYPIPA VYTFGINATF
|
| |