Gene Phep_1802 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1802 
Symbol 
ID8252905 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2099199 
End bp2100989 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content38% 
IMG OID644935453 
ProductThioredoxin domain protein 
Protein accessionYP_003092073 
Protein GI255531701 
COG category[C] Energy production and conversion
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4232] Thiol:disulfide interchange protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.868278 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000561667 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAACGC AATTCACAAA ATTGATCATG CTATTGCTAT GCCTGCTTTG TTTTGGGGGA 
AGCAGCTTTG CGCAAGATTT AACAGGAACC TGGCGCTTAA AAAACTTAAA GACGATTAAG
GGCTCTGAAT ACGTCAACGC CCTACCAAAG CAGATGGTGA TCAACCAAAC TCCGGATGGG
GTTGAATTTA AACTGACCAG TAACCTTGGT GACAGAGATA GCGTCATCAG TCAGCTGTTA
AGCTTTAACA GTATCAATGA AAGTAAGACC CACAGTGGAA AGAAAAAACT TGTAACTATT
CAAAAGAAAG CGGATGGTTC CTGGTTGAAA CATACAAAAG TATTTTCCAA TAACTATCCT
AAAGAGCTGC TCGGTACAGA TGACGAAACC TACACCCTGG ATAAGGAAGG GGGCTTGACG
CTTTTGAGGG TATATGATTC GACCGATGAG GTTAAAGCCG GAATTCAGGA TTATACTGCA
GAAGCGAGCT ATGAAAAACT AGATCCAGAA TCGGCTGCCA GGGAGGCTGC TAAAGGAAAA
GGTGTGAATT TTGTGCAAGG ATTGAACTGG GAACAAATCA AAGCAAAGGC AAAAGCTGAA
AACAAATACA TTTTTGTGGA TTGTTATGCC ACCTGGTGTG GGCCATGTAA GGTAATGGAT
ATGGAGGTTT ACCCATTAAA CATGGTAGGA GAAGCCATGA ATGAGCAATT TATTTCCATT
AAAATACAAA TGGATTCGAC TAAAAACGAT TCTCCGAGTG TAAGACCGTT ATATGCTGCA
GCAAGAGAAT TGGAAAAAAA ATACAATATA ACCGGATTGC CCAGTTATCT TTTTTTTAGC
CCAAGTGGTG AAATTATTCA TAAAGATATG GGCGCGCGAA ATCCAGATGA ATTTTTAAAC
CTCCTTAAAG ATGCCATCAA TCCTAATAAA CAGCTTTATT CTTTAATAAA ACAGATACAT
GCCGGGGAGA TGGACGTTAA TCTCATCCCA GGATTTATCA AACATTTAGA AGACAAGGGC
GAAAAAAGTT TATCAACGGA ACTTACTCGG TATTATATGA AAAATTACCT GGAGAAGTTA
CCTGAAAAGG ATTTTCTAAC CAGGAAGAAC CTCGATTTAA TATTTAAATA TCCCCGGACG
CTGATGACGC AAGATAGAAT TTACCAAGCT TTCTGTAATC AGGCTAATAT TGTAGATAGT
TTAATGGAGT ATCCGGGGTT CTCGGACGCT GCTATAAATT GGGTGTTTAG CAATGAATTC
GTACAACCCA CTTTTGATCA GGCAAAATTG AAAGGAATAG CTCCTGATTG GAAACAGATT
CTTGCAACCT TATGGAGTAA GACCACAAAA GAACGTGCAA ATGTTATTAT ACTTAACTAT
AAAGTTGCAT GGTATAAAGG AAAAAAAGAT TGGGATAATT ATGTAAAATA CCTTTTCCAA
CGTACAAAAA ATGAAAATAT TGAAAGTCCT AATCAATCGG TACTAGGGTT AAACTCTACC
GCATGGGATT TATTTGAATA TTCTTTTGAT AAAAAAGCTC TTGAATTAGG TTTGAAATAT
ATTGATAAAT CAATAGCACT TTGGGCCAAA TCAGAAGGAA GCGCCGGACT TTTAGATACA
AAGGCAAATT TATTATATAA ACTGGGAAGA AATGAGGAAG CGATCCTTTT ACAAAAGCAA
GCGGTTTTAA TAAATCCTGC GTCGAAAGGA TTGAAAAAAA CGCTGGAGAA AATGCTGAGT
AAAGAAAAAA CCTGGGAGTT TGGCGCAAAT GAAAATAGGA AAGTTAAATA A
 
Protein sequence
MKTQFTKLIM LLLCLLCFGG SSFAQDLTGT WRLKNLKTIK GSEYVNALPK QMVINQTPDG 
VEFKLTSNLG DRDSVISQLL SFNSINESKT HSGKKKLVTI QKKADGSWLK HTKVFSNNYP
KELLGTDDET YTLDKEGGLT LLRVYDSTDE VKAGIQDYTA EASYEKLDPE SAAREAAKGK
GVNFVQGLNW EQIKAKAKAE NKYIFVDCYA TWCGPCKVMD MEVYPLNMVG EAMNEQFISI
KIQMDSTKND SPSVRPLYAA ARELEKKYNI TGLPSYLFFS PSGEIIHKDM GARNPDEFLN
LLKDAINPNK QLYSLIKQIH AGEMDVNLIP GFIKHLEDKG EKSLSTELTR YYMKNYLEKL
PEKDFLTRKN LDLIFKYPRT LMTQDRIYQA FCNQANIVDS LMEYPGFSDA AINWVFSNEF
VQPTFDQAKL KGIAPDWKQI LATLWSKTTK ERANVIILNY KVAWYKGKKD WDNYVKYLFQ
RTKNENIESP NQSVLGLNST AWDLFEYSFD KKALELGLKY IDKSIALWAK SEGSAGLLDT
KANLLYKLGR NEEAILLQKQ AVLINPASKG LKKTLEKMLS KEKTWEFGAN ENRKVK