Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_0732 |
Symbol | |
ID | 8251820 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 851666 |
End bp | 853732 |
Gene Length | 2067 bp |
Protein Length | 688 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 644934381 |
Product | TonB-dependent receptor |
Protein accession | YP_003091016 |
Protein GI | 255530644 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.244967 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAACAT TTTTTACCGG CTGTTTAACT GCACTTTGTT TTTCAGGGAT AGCCCAGGTC AAATCCGACA GCACAAAAAA ACTTAATGAG GTTGTGATCC GCCCCTACTT CTCTGCCCAG CCACTGATCC GGTCAACAGG AGCCATTGGG GTACTGGATC AGACCACACT GGACAAACAG CCTGCCGGAT CCTTCGTCTC TGCCATCAAT ACCATTTCCG GTGTACGGAT GGAAGAACGC TCTCCGGGCA GTTACCGTCT GTCTATCCGC GGCAGCTTAT TGCGCTCTCC TTTTGGGATA AGAAATGTAA AAATCTATTT TGACGATTTT CCTTTAACAG ATGCGGGTGG CAACAGCTAC TTAAATGCGC TGGATGTATC GGCAGCCTCC GGAATACAGG TATTAAAAGG TCCGCACAGC AGTATATTTG GCGCCAATTC GGGCGGTGTA ATCCTGATCC AGCCGCGGGG CAGTCAGCCC GACAGCACTG CGCTGTCTTT AAACCTGGAA GGCGGCTCTT ATGGTGCCTT CAGGGAAGGC CTGTCCCTAA ACCAGCAGTT CAACAAATAC AGCCTGAACA TTACCCAGGC CTATCAGCGC AGCGATGGCT ACCGCGACCA TAGTGCTATG GACCGCAAGT ATTTTCAAAC CCTGCACAAA TGGGACTATA CTAAAAATGC CGCACTGAAG GCTTTGTTAT TTTATTCCGA CTTACATTAC AATACTCCCG GTGGCCTTAC CCCAGCCGAA TATCTGCAAA ACCCTAAACT CTCCAGGCCC GCTGCCGGCC CTTCAAAAAG TGCAATAGAA CAGCAGGCGG GCATTTACAG CAAAACCCTT TATGGGGGGC TTTCCCATAA CTGGCAGCTC AGCGATCGGT TTAAACATAT AGTCTCCATA TTTTCTTCTT ATACCGATTT TAAAAATCCC TTCATCAGCA ATTACGAAAA ACGCAAAGAG TTTACCCTGG GCTTACGCAC CTTCCTGGAA TATGAAAAGA AGCTAACTCA AGCAAACTGG AAATTTGACC TGGGCATAGA AAGTATGGAA ACCCGTTCAG ACATTAGCAA TTACGACAAT AACAAGGGGC TGACTGCCGC ACTGAAAGCC TCAGATAAAT TAAAGGCTGC TTCCAGTTTT GCTTTCGCAC ACTTAAGTAT CGATGTACGG CAGAAATGGC TGTTCGAACT TTCGGCAAGC GCCAATTTAT ACCAATATGC CTATCAGAGC ATTGCCCCTG TGGCCATTGC CGAACGAAAA AACAGCTTTG ATACCCAGTT TATGCCAAGG GCAGCACTAT CTTATCTGAT CGGCACACAG TTTTCAGTCA GAACTTCCAT AAGCAAAGGT TATTCAGCCC CTAGTCTGGC CGAAATAAGA GCCTCCAACA ATGTGATCAA TGTAGATCTG CAGCCAGAAT ATGGCTGGAA TTACGAAGCT GGACTAAGGT ACCAGGCCCT GAACAACAGG CTGCTCATAG ATGTTACTGC TTTCTATTAC AACCTGAAAA ATGCAATAGT AAGACGTCTG GACCAAAATG ATGCAGGATA TTTTATCAAT GCCGGCGGTA CAAAACAACC TGGCCTGGAA AGCACAGTTT CATTCTGGCC GATACCTACC CGTACCTCAG GCGCTGTAAG AGGTTTGCAA CTCCGCAATA CCTATACCCT TAGTCGCTTT AAATTTGACA ACTACATCGA CAAAACCAAT AACTTTTCGG GCAATGCCCT TACCGGAGTA CCTAAAACCA TGTTGGTTAG CAGCGCCGAC ATACAATTGC CCAATCAGGT TTACATCTTT CTGCAGCATA GCTTTACTTC GCAGATCCCG CTAAATGATG CCAATACGGC CTATGCAAAA AAATACCACA TCCTGCAGGC CAAAATCAGC TGGAAAAATT TACGTATCGG CCGTACTCCT GCAGAACTCT TTACCGGTGC AGACAATATA CTCAACCAAA GATACAGCCT GGGCAATGAC CTGAATGCCT TTAACGACCG CTATTACAAT GCAGCCGCCA AACGTACTTT TTATGCCGGC CTGCTGCTCC GGCTTAACCA CTTATAA
|
Protein sequence | MKTFFTGCLT ALCFSGIAQV KSDSTKKLNE VVIRPYFSAQ PLIRSTGAIG VLDQTTLDKQ PAGSFVSAIN TISGVRMEER SPGSYRLSIR GSLLRSPFGI RNVKIYFDDF PLTDAGGNSY LNALDVSAAS GIQVLKGPHS SIFGANSGGV ILIQPRGSQP DSTALSLNLE GGSYGAFREG LSLNQQFNKY SLNITQAYQR SDGYRDHSAM DRKYFQTLHK WDYTKNAALK ALLFYSDLHY NTPGGLTPAE YLQNPKLSRP AAGPSKSAIE QQAGIYSKTL YGGLSHNWQL SDRFKHIVSI FSSYTDFKNP FISNYEKRKE FTLGLRTFLE YEKKLTQANW KFDLGIESME TRSDISNYDN NKGLTAALKA SDKLKAASSF AFAHLSIDVR QKWLFELSAS ANLYQYAYQS IAPVAIAERK NSFDTQFMPR AALSYLIGTQ FSVRTSISKG YSAPSLAEIR ASNNVINVDL QPEYGWNYEA GLRYQALNNR LLIDVTAFYY NLKNAIVRRL DQNDAGYFIN AGGTKQPGLE STVSFWPIPT RTSGAVRGLQ LRNTYTLSRF KFDNYIDKTN NFSGNALTGV PKTMLVSSAD IQLPNQVYIF LQHSFTSQIP LNDANTAYAK KYHILQAKIS WKNLRIGRTP AELFTGADNI LNQRYSLGND LNAFNDRYYN AAAKRTFYAG LLLRLNHL
|
| |