Gene Phep_2079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2079 
Symbol 
ID8253183 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2395820 
End bp2398906 
Gene Length3087 bp 
Protein Length1028 aa 
Translation table11 
GC content45% 
IMG OID644935727 
ProductTonB-dependent receptor 
Protein accessionYP_003092346 
Protein GI255531974 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.297647 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAATT TTAAAAAGGA GGGGTATGTA TCTTATACCG GTTCCTTGTT AAAAAAACTA 
TTGCTGCTAA AAGGCAGTAT ACTGGTACTC TCATTGTTAC CCTTTGTAGC TGCCGCGCAG
ACCCAGCCAC TTATCAATTC AACACTTAAA GGACAGGTAG TAGATTCACT TACAAAACAA
GGGATCCCGG GCGTAAGCAT ACACATTACA GGAACTACCC ATACTGTGCA AACGGACGGA
GAGGGCAGGT TTGATTTTGT AACCGGACAA AAATTTCCTT ATACGCTTGA AATCAGACAT
GTTTCCTATA GTAAGAAAGA TGTTGTCGCC AATGGCAGCC CAATACTGGT GAAATTAAAG
GAGCTCACCG GTCAGCTCAA TGATGTGGTC ATTGTAGGTT ATGGTACCCA GCAGCGTAAA
GATCTGATTG GTTCAGTTTC CAAGATAGAC CCCTCAGGTA CAAAAAATAT TCCCGAAGCA
GGCTTCGATT CCCAGTTACA GGGCAAGGCT GCTGGTGTAC AGATCAATTC AAATACAGGT
GTACCCGGTT CGGATATATT TATCAGGGTA AGGGGCGCAA CTTCCATTAA TGCCACCAAT
GATCCGCTGT ATATAATAGA TGGCGTTTGG GTAAATAACA GCAGTCTCCA GAACATTGCA
CAGGAGCGGG GTACCTCTCC ACTTTCTGAC ATTAACCCTG CTGATATAGA AAGTATTGAA
ATATTGAAAG ATGCCACCGC TATCGCTATT TATGGATCCA GAGGAGCAAA TGGTGTGGTG
CTGGTCACGA CAAAAAGGGG CAATTACGGG CAAAAAACAA AGATAGATTT TAATGGATCG
GAAGGTTTGG GCAGGGCCCC GGCAGACAGG GTATGGAAAA CCACCACCGG ACCCGAGCAC
GCCCGTCTGA TTAATGAATA CAGCAGAAAT ATGGGCAAGC CTGAGCCATT CAGGCCGGTA
ACTGAAGTCA TTAACGGCGT TGCCGGCAGG GGATTGCCTG ACGGACAGCC AACTTACGAC
CGCATGAGCT ATTTAAACCG TACGGCCACA CTCCGCAATT ACGATCTCTC GCTGCAGGGC
GGATCTGACA GAACACGCTT TTATCTTGGG GGTGGTTATA CCTTTCAGGA ATCAATCTGG
AAACCCATGT CTTTCGACCG TGCAGGTTTA AAAGTTAACC TCGACCATAA ACTGAGCTCA
AAAATTGCAA TCGGTACCAG CAATACCATT TCAAGAAGTC ACCGCGACCA GGCACGTCCT
GCAAATGGTG CAAACGGAAC ACTGCTGCAG GCCTCATTGA ACATACCTAC CTATCTTCCC
ATCTTTGATG CCAATGGAAC TCCTTTAAAA TGGGTTAATT TCGACAATAT CGATGTGCTG
ACCAGTACAG TAAACCTATG GTCAAACAGT TATCATTATA TTGGCAATAT TTACCTGGAT
TATGCCATCA CACCCAAATT AAAGTTCCGT TCTACCTTTG GTGTCGATTA CAACAACTAT
GAAGAGAACG AATACTGGGA TACCCGGACC ATTCTTGGTA ACAGTGGGGG AAGGGGTACA
CAAAGCATCA CCCAGTCTTC CACAGCTATA AACGAACAGA CCCTGGCCTA CAATGATAAG
GCCGGGAAAC ATAGTTTTGG CATACTGATC GGTAATACCC TGCAGGGTGC CGAAGTGAAG
AACGTATCGG CCACAGGTAC AAATTTTCCA AACAATTCTT ACACACAGAT TTCCTCGGCC
GCCACACAAA TTGCATCGCA GTTTAAAACA AACAGTACCC TGGCCTCTTT TTTCTCGAGG
GCAGATTATA ATTATGCCGG TAAATATTAT GCAGAGTTTA CAGTAAGGGC AGATGGTTCT
TCCAAATTTG GTAAAAACAA TAAATGGGGC TATTTTCCTG CAATCGGTGC TGCATGGCGT
ATCAAAGAAG AAAATTTCCT TAAAAACGTT TCCCAGATCA GTAACTTAAA ATACAGGGTC
AGTTATGGCA TCACCGGAAA CCAGGCCGGT ATCAATGATT TTGCTTCTCA GGGCCTTTGG
ACAGGGGGTT TTGGCTATGC TGATGTGGCC GGCGGGGCCG AATTACCTGG TACAGCCCCT
TTACAGCTGG CCAATCCCGA CCTGAAATGG GAGAGTACGG CCCAGTTCAG CACCGGACTT
GACATTGGCT TGTTCGAAGA CAAACTGAAC ATAGAGTTTA ATTATTATAA CAAATATACC
AGGGATGCTT TATTGCAGGT TGCTGTTCCG GGTACATCTG GTTTTTCTTC CTATCTCACC
AATTATGGCG AGATCAGTAA TAAAGGATTT GAATTGTCTA TCAGTTCCAT AAACCTCAGG
ACAACCGGCT TTACCTGGAA AACGGATTTT AACATTGCCA GAAATAAGAA TACCATCGAA
AAGATACCTG CCGACATCCC CTTTGCAGGA AGGGACCTCA TCCGTTTGCA GCAAGGAAAG
GAATTGTATT CCTACTGGTT ATATAAACAG CTGTATGTGA ATCCTGATAA TGGTGAAGCG
GTATTTGACG ATTATAATAA GGATGGTAAA ATAACTGCCG ACGACCGTCA GATTGTTGGT
AGTACCTGGC CTAAGTTTTT TGGTGGGCTT ACCAATAATT TCAGCTATAA AGGGTTGGAC
CTTAGTATCT TTTTTACTTT CTCTTATGGA AACTATTTAT GGAACCACAA CAGGATGCTC
GGAGAAACAG GGGGTACACT GGATGCGGGC CGGGTATTGC TGAAAAGCCA GCTGGACCGC
TGGACTACAC CAGGACAGGT TACAAACACC CCAAAACTTA ACGACGCCAA TTATGCCAGG
CAAGAGAACA GCCGCTTTTT TGAAGATGCT TCATTTTTAA GGTTACGTGC TTTAACCCTT
GGCTACACCC TGCCTGCAGT GTTTACTGAG AGGGTCAGGA TCCGGAAAGT GAGATTTTAT
GTAAGTGGCA GTAACCTGTT GCTATTTACG AAATATACAG GTGCAGATCC TGAATCAAAC
CTGGGCACCC AGAACATACA AGGTTATGAC TATGGTACCC CGCCACAACC ACGTACCGTA
CAACTGGGCC TTAACCTTAC ACTTTAA
 
Protein sequence
MQNFKKEGYV SYTGSLLKKL LLLKGSILVL SLLPFVAAAQ TQPLINSTLK GQVVDSLTKQ 
GIPGVSIHIT GTTHTVQTDG EGRFDFVTGQ KFPYTLEIRH VSYSKKDVVA NGSPILVKLK
ELTGQLNDVV IVGYGTQQRK DLIGSVSKID PSGTKNIPEA GFDSQLQGKA AGVQINSNTG
VPGSDIFIRV RGATSINATN DPLYIIDGVW VNNSSLQNIA QERGTSPLSD INPADIESIE
ILKDATAIAI YGSRGANGVV LVTTKRGNYG QKTKIDFNGS EGLGRAPADR VWKTTTGPEH
ARLINEYSRN MGKPEPFRPV TEVINGVAGR GLPDGQPTYD RMSYLNRTAT LRNYDLSLQG
GSDRTRFYLG GGYTFQESIW KPMSFDRAGL KVNLDHKLSS KIAIGTSNTI SRSHRDQARP
ANGANGTLLQ ASLNIPTYLP IFDANGTPLK WVNFDNIDVL TSTVNLWSNS YHYIGNIYLD
YAITPKLKFR STFGVDYNNY EENEYWDTRT ILGNSGGRGT QSITQSSTAI NEQTLAYNDK
AGKHSFGILI GNTLQGAEVK NVSATGTNFP NNSYTQISSA ATQIASQFKT NSTLASFFSR
ADYNYAGKYY AEFTVRADGS SKFGKNNKWG YFPAIGAAWR IKEENFLKNV SQISNLKYRV
SYGITGNQAG INDFASQGLW TGGFGYADVA GGAELPGTAP LQLANPDLKW ESTAQFSTGL
DIGLFEDKLN IEFNYYNKYT RDALLQVAVP GTSGFSSYLT NYGEISNKGF ELSISSINLR
TTGFTWKTDF NIARNKNTIE KIPADIPFAG RDLIRLQQGK ELYSYWLYKQ LYVNPDNGEA
VFDDYNKDGK ITADDRQIVG STWPKFFGGL TNNFSYKGLD LSIFFTFSYG NYLWNHNRML
GETGGTLDAG RVLLKSQLDR WTTPGQVTNT PKLNDANYAR QENSRFFEDA SFLRLRALTL
GYTLPAVFTE RVRIRKVRFY VSGSNLLLFT KYTGADPESN LGTQNIQGYD YGTPPQPRTV
QLGLNLTL