Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_1802 |
Symbol | |
ID | 8252905 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 2099199 |
End bp | 2100989 |
Gene Length | 1791 bp |
Protein Length | 596 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 644935453 |
Product | Thioredoxin domain protein |
Protein accession | YP_003092073 |
Protein GI | 255531701 |
COG category | [C] Energy production and conversion [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4232] Thiol:disulfide interchange protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.868278 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0000561667 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAACGC AATTCACAAA ATTGATCATG CTATTGCTAT GCCTGCTTTG TTTTGGGGGA AGCAGCTTTG CGCAAGATTT AACAGGAACC TGGCGCTTAA AAAACTTAAA GACGATTAAG GGCTCTGAAT ACGTCAACGC CCTACCAAAG CAGATGGTGA TCAACCAAAC TCCGGATGGG GTTGAATTTA AACTGACCAG TAACCTTGGT GACAGAGATA GCGTCATCAG TCAGCTGTTA AGCTTTAACA GTATCAATGA AAGTAAGACC CACAGTGGAA AGAAAAAACT TGTAACTATT CAAAAGAAAG CGGATGGTTC CTGGTTGAAA CATACAAAAG TATTTTCCAA TAACTATCCT AAAGAGCTGC TCGGTACAGA TGACGAAACC TACACCCTGG ATAAGGAAGG GGGCTTGACG CTTTTGAGGG TATATGATTC GACCGATGAG GTTAAAGCCG GAATTCAGGA TTATACTGCA GAAGCGAGCT ATGAAAAACT AGATCCAGAA TCGGCTGCCA GGGAGGCTGC TAAAGGAAAA GGTGTGAATT TTGTGCAAGG ATTGAACTGG GAACAAATCA AAGCAAAGGC AAAAGCTGAA AACAAATACA TTTTTGTGGA TTGTTATGCC ACCTGGTGTG GGCCATGTAA GGTAATGGAT ATGGAGGTTT ACCCATTAAA CATGGTAGGA GAAGCCATGA ATGAGCAATT TATTTCCATT AAAATACAAA TGGATTCGAC TAAAAACGAT TCTCCGAGTG TAAGACCGTT ATATGCTGCA GCAAGAGAAT TGGAAAAAAA ATACAATATA ACCGGATTGC CCAGTTATCT TTTTTTTAGC CCAAGTGGTG AAATTATTCA TAAAGATATG GGCGCGCGAA ATCCAGATGA ATTTTTAAAC CTCCTTAAAG ATGCCATCAA TCCTAATAAA CAGCTTTATT CTTTAATAAA ACAGATACAT GCCGGGGAGA TGGACGTTAA TCTCATCCCA GGATTTATCA AACATTTAGA AGACAAGGGC GAAAAAAGTT TATCAACGGA ACTTACTCGG TATTATATGA AAAATTACCT GGAGAAGTTA CCTGAAAAGG ATTTTCTAAC CAGGAAGAAC CTCGATTTAA TATTTAAATA TCCCCGGACG CTGATGACGC AAGATAGAAT TTACCAAGCT TTCTGTAATC AGGCTAATAT TGTAGATAGT TTAATGGAGT ATCCGGGGTT CTCGGACGCT GCTATAAATT GGGTGTTTAG CAATGAATTC GTACAACCCA CTTTTGATCA GGCAAAATTG AAAGGAATAG CTCCTGATTG GAAACAGATT CTTGCAACCT TATGGAGTAA GACCACAAAA GAACGTGCAA ATGTTATTAT ACTTAACTAT AAAGTTGCAT GGTATAAAGG AAAAAAAGAT TGGGATAATT ATGTAAAATA CCTTTTCCAA CGTACAAAAA ATGAAAATAT TGAAAGTCCT AATCAATCGG TACTAGGGTT AAACTCTACC GCATGGGATT TATTTGAATA TTCTTTTGAT AAAAAAGCTC TTGAATTAGG TTTGAAATAT ATTGATAAAT CAATAGCACT TTGGGCCAAA TCAGAAGGAA GCGCCGGACT TTTAGATACA AAGGCAAATT TATTATATAA ACTGGGAAGA AATGAGGAAG CGATCCTTTT ACAAAAGCAA GCGGTTTTAA TAAATCCTGC GTCGAAAGGA TTGAAAAAAA CGCTGGAGAA AATGCTGAGT AAAGAAAAAA CCTGGGAGTT TGGCGCAAAT GAAAATAGGA AAGTTAAATA A
|
Protein sequence | MKTQFTKLIM LLLCLLCFGG SSFAQDLTGT WRLKNLKTIK GSEYVNALPK QMVINQTPDG VEFKLTSNLG DRDSVISQLL SFNSINESKT HSGKKKLVTI QKKADGSWLK HTKVFSNNYP KELLGTDDET YTLDKEGGLT LLRVYDSTDE VKAGIQDYTA EASYEKLDPE SAAREAAKGK GVNFVQGLNW EQIKAKAKAE NKYIFVDCYA TWCGPCKVMD MEVYPLNMVG EAMNEQFISI KIQMDSTKND SPSVRPLYAA ARELEKKYNI TGLPSYLFFS PSGEIIHKDM GARNPDEFLN LLKDAINPNK QLYSLIKQIH AGEMDVNLIP GFIKHLEDKG EKSLSTELTR YYMKNYLEKL PEKDFLTRKN LDLIFKYPRT LMTQDRIYQA FCNQANIVDS LMEYPGFSDA AINWVFSNEF VQPTFDQAKL KGIAPDWKQI LATLWSKTTK ERANVIILNY KVAWYKGKKD WDNYVKYLFQ RTKNENIESP NQSVLGLNST AWDLFEYSFD KKALELGLKY IDKSIALWAK SEGSAGLLDT KANLLYKLGR NEEAILLQKQ AVLINPASKG LKKTLEKMLS KEKTWEFGAN ENRKVK
|
| |