Gene Phep_3497 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3497 
Symbol 
ID8254617 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4162957 
End bp4166067 
Gene Length3111 bp 
Protein Length1036 aa 
Translation table11 
GC content44% 
IMG OID644937147 
ProductTonB-dependent receptor plug 
Protein accessionYP_003093750 
Protein GI255533378 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.124606 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAAAC ACTTTTACAA AACGTGGTAT AGTTACAAAC CATGCAGTTT AATTTTAACG 
CTGCTCCTGT TTGCAAATTT TGCTTTTTCG CAGGATAAGA CAGGATTGAT CAAAGGAAAG
ATAACTGATG ATTCTGGGCC TTTGCCGGCG GCATCGGTTT TGGTAAAAGG ACCTAAAAAC
GGAACAACTG CTGATGCGCA GGGTAATTTT AGCATCAGGG TTGCAGAAGG CACTTATACA
CTTACTGCCT CGTACATTGG TTATACTTCA GCAGATCAGG TGAATGTTAA AGTAACAGCC
GGCAAAGAAA CGGTGGTTAA TTTTAAACTG TCGTCAAATG TACAGCTGCA GGAAGTTCAG
GTAAGTTATG GTAAGCAAAG GGCCAGAGAG ATTACCGGTT CTGTTGCACA GGTAGATGCC
TCTACTTTGC AGGACATGCC GGTAAACCAG TTTGCACAGC AGCTGGCCGG TAAGGTTGCG
GGGGTGCAGG TGGCCCAGAC CAGCGGACAG CCAGGCCGGG GAATGTCCTT CCGTATCCGT
GGGGCTACTT CAACCAAGGC CGATAATCAG CCTTTGTTTG TGGTAGACGG GATGCCGGTT
ACAGGGAGTA TCAACAACAT TAATCCTGCA GAGATCGAAT CTTTCTCTAT CTTAAAAGAT
GCTTCTGCCA CGGCTTTGTA TGGCTCGCGC GCAGCAAACG GGGTTATCCT GATCACTACC
AAACATGCCA AATCTGGTGA TGCCAAGATC GAGTTCAATG CCAATTATGG TGTGCAGAAG
ATCCCGGAAA ACCGGGTTCC CCAAATGATG AATGCCCGGG AATTCGCTAC TTATATGAAA
AACAGGTACG AGGACCAGAA AATATATGTG CCTACATTTG TTCCGCCAGC GGATCAGGTA
GCTGCTTATG GCAATCCTGA GCAATATGGC GAAGGTACCA ACTGGTTTAA ACTGATGACC
AGGACAGCGC CCATACAGGA TTATAATGTA AGTTTTCAGT CGGCCCGCGA AAAATCTTCT
TCTACGGTTA TTGCAGGGTA TCTGGAACAG CAGGGAGTAC TGATCAATAC TTCAACCAAA
CTGTACTCTT TACGTTATAA TGCAGACCTT TCTTTAAGCA ATAATAAATT AAAGATAGGG
GTTAACGTAG CACCAAGTTA TCGACTGGAC CACAACAACA GGGTAAGTAC CGATGGTGTT
GGCGGATGGT TTGAGAACGG TATGGAGGCC AGTCCGTTGG AAACTCCTTA TAACGCCGAT
GGTTCTTTAA AAAGATTTGT AAAATCGCCG GGTATGGTAG ATTACATCAA CCCGGTAGCC
AGGTTTCTAG GTACCAAAGA CGATTATATC ACTACCCGGA TACTGGGAAA TGCTTATTTA
AACTATGAAT TTTTACAGGG CTTAAGCTTA AAAACGAATT TTGGTGTAGA TAAAGGTTTT
GAAACCCGCA ACAATTTTAT ATCAGGAGTA ATTGCACCAA CACAAGGAGT ACCTACTGCT
ATCAACCAAT CTTTTGATAA CGGATCGTAC ACAGCGGAAG CAAACCTGGT ATACAATAAA
ACCTTTGGGA CCGATCATCA CATTGAGGCC CTGGGTGGAT ATTCCGTACA GCAGTACAGG
GGATATAGTG CGACCATCAA CGGTACCAAT TTTCCAAGTG ACGATATCCA GTATCTTTCT
GCTGCTACCA GCATTACATC GGCTACAAGT GGTTTAAGTG AATATTCCCT GATGTCGACT
ATTGGCCGGT TAAATTATAA TTATAAGGGT AAATACCTTT TGTCTGGAGC CATACGCCGT
GATGGTTCAT CAAGGTTTGG GGCAAATAAA AGGTATGGTA CTTTCCCTTC TGTATCGGCA
GGTTGGGTAA TGAGTGATGA AAATTTTATG AAGGGCTTTA ATTTTGTAGA TCTGTTTAAG
ATACGTGCAA GTTATGGTAT TGTAGGAAAC AATGCATTCG GAAATTACGA AGCGCTGGCC
ACTATGGGGC AATCCAATTA TATTTTAAAT GGCGCATTGG CTTCCGGGCA GGCCATTACC
CGTTTAGAGA ATGCTGAGCT TGCCTGGGAA CGCAACAAAC AGTTTGATGT GGGCTTTGAT
CTTTCTGTTT TAAAGAACCG GGTAAGTATC ACTTATGATT ATTACCATAA GGTTTCGGAC
GGACTGATCC AGGACAGGCC TATTCCATGG GCATCCGGCT TCAGGTCGAT CTTATTTAAT
GTTGGGGAGA TTGAATTCTG GGGGCATGAG TTTTCTGTAA ATTCCGAGAA TTTAACGGGT
AAATTAAAAT GGAACTCCAA TTTTAACATC TCATTTGATA GGAACATCAT CAACAACCTG
GTAGCACCTG GCTTTGTAAG AAGAAATACC GGTGTATCTT CCGATTACTT TCGCCAGCAG
ATTGGTCACC GCTTAGGTGA GTTTTATGGC TTTGTGTTCC AGGGCCTCTA CACGGCTGCT
GAAATAGCAG ATCCTACAGT AGCCAAGTAC AGGAATTCTG CCGAAGGTAC CTTGAAAATG
AAAGACATCA GCGGACCAAA TGGTGTTCCG GATGGCATCA TCTCTGATGA ATACGACCGT
ACTTTTATCG GCGACCCGAC CCCTGATTTC AATTTTGGGT TTACCAACAA TTTTACCTAC
AAAAACTTCG ACCTTAACAT TACTATGGCC GGCTCTGTTG GTGGCAAGTT GCTGAATGCG
GCCAAATGGG CTTATGCAAC CAATTTGGAC GGTTCGAGGG TGATGCTGAA AGCTGCTGCA
GACCACTGGA GATCGGCAGA TAACCCCGGT TCAGGAATTT ATCCTACAAC CCGATATAGT
ACAACAGATA TGGGGCGTCA GGTGAACAGC CAGTGGGTAG AGAGCGGATC TTATCTGGCA
GCAAAAAACA TTTCACTGGG CTACCGTTTC AATATGAAAG GTAAAACGCT GCTCCAGAAT
TTCAGGGTTT ATGCATCTGT ACAACAGGCT TTTGTAATAA CAGGCTATAG TGGGATGAAC
CCGGAGATCA GCTTTGACGG CACCGATGCT TTTAAAGGAA TCGGGGTAGA TGAAAATGGC
TATCCGGTTC CAAGAACATT TTCAATCGGT ATTTCAACAA CATTCAGATA G
 
Protein sequence
MRKHFYKTWY SYKPCSLILT LLLFANFAFS QDKTGLIKGK ITDDSGPLPA ASVLVKGPKN 
GTTADAQGNF SIRVAEGTYT LTASYIGYTS ADQVNVKVTA GKETVVNFKL SSNVQLQEVQ
VSYGKQRARE ITGSVAQVDA STLQDMPVNQ FAQQLAGKVA GVQVAQTSGQ PGRGMSFRIR
GATSTKADNQ PLFVVDGMPV TGSINNINPA EIESFSILKD ASATALYGSR AANGVILITT
KHAKSGDAKI EFNANYGVQK IPENRVPQMM NAREFATYMK NRYEDQKIYV PTFVPPADQV
AAYGNPEQYG EGTNWFKLMT RTAPIQDYNV SFQSAREKSS STVIAGYLEQ QGVLINTSTK
LYSLRYNADL SLSNNKLKIG VNVAPSYRLD HNNRVSTDGV GGWFENGMEA SPLETPYNAD
GSLKRFVKSP GMVDYINPVA RFLGTKDDYI TTRILGNAYL NYEFLQGLSL KTNFGVDKGF
ETRNNFISGV IAPTQGVPTA INQSFDNGSY TAEANLVYNK TFGTDHHIEA LGGYSVQQYR
GYSATINGTN FPSDDIQYLS AATSITSATS GLSEYSLMST IGRLNYNYKG KYLLSGAIRR
DGSSRFGANK RYGTFPSVSA GWVMSDENFM KGFNFVDLFK IRASYGIVGN NAFGNYEALA
TMGQSNYILN GALASGQAIT RLENAELAWE RNKQFDVGFD LSVLKNRVSI TYDYYHKVSD
GLIQDRPIPW ASGFRSILFN VGEIEFWGHE FSVNSENLTG KLKWNSNFNI SFDRNIINNL
VAPGFVRRNT GVSSDYFRQQ IGHRLGEFYG FVFQGLYTAA EIADPTVAKY RNSAEGTLKM
KDISGPNGVP DGIISDEYDR TFIGDPTPDF NFGFTNNFTY KNFDLNITMA GSVGGKLLNA
AKWAYATNLD GSRVMLKAAA DHWRSADNPG SGIYPTTRYS TTDMGRQVNS QWVESGSYLA
AKNISLGYRF NMKGKTLLQN FRVYASVQQA FVITGYSGMN PEISFDGTDA FKGIGVDENG
YPVPRTFSIG ISTTFR