Gene Phep_3172 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3172 
Symbol 
ID8254291 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3774855 
End bp3778271 
Gene Length3417 bp 
Protein Length1138 aa 
Translation table11 
GC content42% 
IMG OID644936825 
ProductTonB-dependent receptor plug 
Protein accessionYP_003093429 
Protein GI255533057 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.455515 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATGAAA ATTTTAGGGT AAGCGCCCCA GCTTTCTGGC AGCGCTATTT AAAACAGCTT 
ATGATGTTAA AGCTAATGTT ATTAATGAGT TTGGTATTTG CTTTTTCTGC AAATGCCAAT
CACGCATTGG CCCAAAGGGC CAGTCTCAAT GTAACGCAAA AATCGCTTAA AGAAGTGTTT
AGGCTACTCA AAGAGCAAAC AGATGTCGAT TTTCTTTTTA CAGAAAGCCA GCTAAAAAAT
GCCCATCCTG TTAGTATTAG CGTAAAGCAA AAAGGACTGA AAGAAATTTT GGCACTCTGT
TTTGAAGGGC AGGGCCTGAG CTATGTACTA AATGGCAATA CGGTGATCGT AAAAAAAGAG
AACAACAGTC CTGTCCTGGT CAATATCCAG CAGCGTACCA TTACCGGTAT CGTAACAGAT
GAAAAGGGAA CTTCCATACC TGGTGTAAAT ATCGAAGTAA AAGGTACCGG CAAACGGGCA
GTAACCAACA GCGACGGGAA GTATTCCATC AGCATTGATA AAGACAATGC GGTGCTTGTT
TTTAGCTATA TGGGCATGGC TACCCAGGAA GTAGCTGTTG CCAGCCGCAA TGTTGTTAAT
GTGGAGCTTA AAGCAGAATC TGCTGATTTG AGCGAGGTGG TGGTGGTAGG TTACGGCACG
CAGAAAAAAG TAAACCTGAC CGGTGCAGTA GCTTCCATTG ACATTGCTAA AGTTGCCGAT
TCCAGGCCAA TTACCAATGT TTCTTCTTTA CTTACCGGAC TTGCACCAGG CTTGTACGTG
AAATCAGGAA ATGCAGATCC GGGTGGCAAT GCCTCACTAC TCATCAGAGG CCAGGGAACA
CTGAACAATT CTGCCCCTTT GGTGATCATT GATGGAGTGG AAGGCGACAT TTCGAGAGTC
TCACCTCAGG ACATTTCTTC TGTTTCGGTA TTGAAAGATG CTTCTTCAGC TGCAATTTAT
GGCTCCCGGG CAGCTAATGG TGTATTGTTG ATTACCACCA AGCAAGGTGT AAAAGGTAAA
TTTTCCATCA GTTATGATGG TTATGCCACC ATGCAGTCAG TAGGACACCT GATGCCATTG
GTAGACAACA GTGTACGGTA TATGGAACTG CTGAATGAAG CTGCAAAAAA TTCGGCTGTA
GCACCGGTAT TTACCGAAGC AAACATCCAG AAATGGCGTG ACAATGCAGG TGGCGATCCA
CTGCTTTGGC CAAATACCAA TTGGGCAGAT GGCCTGTTCA GAGACGTAAC AGCCGTAAAC
CATAATGTTT CGGTATCGGG TGGTACAGAT AAGCTCACTT CATTTATGTC ATTCAATTAT
GCCAATAATC CTGGAATGAT AGAAAATACC GGCTTTCAGC GGTACAGCCT CCGGTCTAAT
ACCCAGCTAC AGGCTACAAG CTGGTTAAAA GTAGGTATGA ATTTAAACGG AACTTATTCT
ACAAAGGATA GGGGAAGTCA GCAATTGGAA GGTATGTTCA TCAACTCCAT TCTTGCCGTA
CCAACCGTAG TTCCAAGACA TCCGGATGGC AGATATGGCG GAACCAATAA TTCGGAAGAA
AACTCTGTCG CGCTGTCGCC TATTTTTTAT GTGAACCAGG TTAAAGGGAC TAATAAAACG
AATACCCTGG TCAGCCGGTT TTATGTAAAC GTAAATCCTA TGGAGGGGCT TAACGTCAAT
GCCTCTTATA ACTATAACTT GTTTGACAAT AAAATTACTA CAATTCCCAC GCAAAACGAC
CGCTGGAATT TCCAGACCAA TACCATTCTG GTACCTGGTG CGGTAGCTTT ATATGTTCAG
AAACAGGACA TCAATAATGT CAGGAACTTT ATGGATTTTG ATGCATCCTA TGAAAAAACG
TTGGATAAGC TGAACATGAA ATTAATGGTT GGCGGAAGTC AGGAAAGGTA TCTTGCTGAA
AACCTGGCGG TAACCAAACA AGGGCTTATC GATGAGAACC TTACGCAACT TAACGGGGCT
ACCGGCGTGG CTACAGCAAC AGGTACATTG GATGGAGACT GGGCCATGCA TTCCTATTTT
GCAAGGTTGA ATATGGCCTG GGATTCCAAA TACCTATTGG AACTGAACAT CAGAAGAGAT
GGATCTTCCA GGTTTACGCA GGCCAACCGC TGGGGAAATT TTCCTTCAGC TTCTGTAGGA
TGGAGAATTT CCGAAGAGGA ATTTATGAAA TCGCTGAAAT CAACCTGGTT AAATGATTTA
AAATTCAGGG CCTCATATGG TGCTTTAGGT AACAATGCAG TGGGCGATTA CGAGTCTATT
TCTGTGTTGT CGTCTGTCCT GTATTCTTTT AACAATGCGC CGGTAAATGG TTTTTATCAA
ACCAGGTTCC CTAACCTCGG TTTATCCTGG GAATCTACCT ATGTAACCAA TTTGGGACTT
GATTTTAGTC TTTTTAGCAA CCGTTTGTCG GGTAATCTGG ATGCCTACAA TAAACTCACC
AAAAACATAT TGATCAGCTT GCCTGCCCCA TTGGTTAATG GTACCATTTC TATACCGCCG
CAAAACAGTG CCGAAGTACA AAACAGGGGA ATAGAGCTGG GCTTAAACTG GAAAGATAAA
ATTGGCGAAG TAGGTTATTT TGTAGGTACT AATTTCACCT TTAATGCAAA TAAGGTACTC
AAATTTAAAG GAAGTGAATA TTCGCTTTCG GGAACTTCAA TGATTAAGGA AGGATTACCT
ATCAATACCC AATACGTTTT ACTGGTAGAT AGAATCATAC AGACGCCAGA GGATGTACAA
TGGGTAGCAG ACAGAATTGC CAATGCCCCC TTTGATCCGA ATGATCCTGA AACAGATCCA
ACCAAGAAGA GAAGGCTCAA TGCTTTTCCG TATGGAAAAC CCGAACTTGG AGATTTCCTG
TTTAAAGATG TAAACGGCGA TGGAATTGTA GATGATAACG ACAGGGCCAA TGTAGGAAAA
GGGTCTAATC CGCAGTTTTT CTATAGCTTC ACTTTAGGGG CCAATTATAA AGGCTTCGAT
TTTTCAGCAA TGATTGATGG AGTAGGCGGA ATAAAAACCT ATTTTCAAAA TGATTATTAC
AACCCCGTGC TGAGGTGGTC CCGCATCATA AATCAGGAAA TAGCAGATGG AAGATGGTAT
CCTGGGCGCA CTACAACGGC TACCTATCCA AGGTTGCTGC TAAACGACAA CAGAAATACC
CGCTCCAGCG ACTTTTGGGT ACAGGATATG TCTTTCCTGA AGATCAGGAA TATTCAATTG
GGCTATTCAT TACAATCTAA TGTATTGTCA AAACTCAAAG CTTCAAAGAT CCGTTTTTAT
GCCACACTTG AAAATTATTT CACCTTTACC AAGTATAAAG GTTTAGATCC GGAAGTTTCC
GGCATGGCTT ATCCGAACGT AAAACAAGCT GTTTTTGGAA TGAATTTAAC TTTTTAA
 
Protein sequence
MYENFRVSAP AFWQRYLKQL MMLKLMLLMS LVFAFSANAN HALAQRASLN VTQKSLKEVF 
RLLKEQTDVD FLFTESQLKN AHPVSISVKQ KGLKEILALC FEGQGLSYVL NGNTVIVKKE
NNSPVLVNIQ QRTITGIVTD EKGTSIPGVN IEVKGTGKRA VTNSDGKYSI SIDKDNAVLV
FSYMGMATQE VAVASRNVVN VELKAESADL SEVVVVGYGT QKKVNLTGAV ASIDIAKVAD
SRPITNVSSL LTGLAPGLYV KSGNADPGGN ASLLIRGQGT LNNSAPLVII DGVEGDISRV
SPQDISSVSV LKDASSAAIY GSRAANGVLL ITTKQGVKGK FSISYDGYAT MQSVGHLMPL
VDNSVRYMEL LNEAAKNSAV APVFTEANIQ KWRDNAGGDP LLWPNTNWAD GLFRDVTAVN
HNVSVSGGTD KLTSFMSFNY ANNPGMIENT GFQRYSLRSN TQLQATSWLK VGMNLNGTYS
TKDRGSQQLE GMFINSILAV PTVVPRHPDG RYGGTNNSEE NSVALSPIFY VNQVKGTNKT
NTLVSRFYVN VNPMEGLNVN ASYNYNLFDN KITTIPTQND RWNFQTNTIL VPGAVALYVQ
KQDINNVRNF MDFDASYEKT LDKLNMKLMV GGSQERYLAE NLAVTKQGLI DENLTQLNGA
TGVATATGTL DGDWAMHSYF ARLNMAWDSK YLLELNIRRD GSSRFTQANR WGNFPSASVG
WRISEEEFMK SLKSTWLNDL KFRASYGALG NNAVGDYESI SVLSSVLYSF NNAPVNGFYQ
TRFPNLGLSW ESTYVTNLGL DFSLFSNRLS GNLDAYNKLT KNILISLPAP LVNGTISIPP
QNSAEVQNRG IELGLNWKDK IGEVGYFVGT NFTFNANKVL KFKGSEYSLS GTSMIKEGLP
INTQYVLLVD RIIQTPEDVQ WVADRIANAP FDPNDPETDP TKKRRLNAFP YGKPELGDFL
FKDVNGDGIV DDNDRANVGK GSNPQFFYSF TLGANYKGFD FSAMIDGVGG IKTYFQNDYY
NPVLRWSRII NQEIADGRWY PGRTTTATYP RLLLNDNRNT RSSDFWVQDM SFLKIRNIQL
GYSLQSNVLS KLKASKIRFY ATLENYFTFT KYKGLDPEVS GMAYPNVKQA VFGMNLTF