Gene Phep_3406 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3406 
Symbol 
ID8254525 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4054805 
End bp4057825 
Gene Length3021 bp 
Protein Length1006 aa 
Translation table11 
GC content44% 
IMG OID644937058 
ProductTonB-dependent receptor plug 
Protein accessionYP_003093662 
Protein GI255533290 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTACTG AGGCAAAAGC AGAAGACTTC TTTTACCAGG AAGCGCGCGC TAAGCTGATA 
ACAGGTAAAG TAATCGATGA GAATAAACTG CCACTAATCG GTGTATCCGT TAAAATTAAA
GGCGGAACTG GTGGAGCCAT TACAAATGCA GAAGGGGATT TTACCCTGAG TGTTAAAAGT
GAAAAGGATT CCATACAGTT CTCTTATATC GGCTACAAAA CCCAGACGAT CAGTGCTGGC
GTTACCGGAT TTGTCACCAT CCAAATGACT CCAAGGTCTA AAGAAAACCT GGACGAAGTT
GCCATTGTTG GTTACGGAAC GCAGAAAAAG GTTTCTGTAA CAGGCGCGAT CAGTACCATT
TCCGTATCTG AAATGCAGAA AGTATCCACT CCCTCCTTAT CTAATGCGAT AGCAGGTAAG
CTCCCCGGTA TTATTACCAG GCAGGCTACC GGAGAGCCTG GATATGATGC GGCTGCCATC
TACATCCGCG GCCTTTCTAC GTTTGGTCAA AATGCGCCAT TGGTGCTGAT AGACGGTGTA
GAACGTGATA TGAACCAGAT CAATGCGCAA GAGATTGAAA GCTTCACTAT CCTCAAAGAT
GCCTCTGCAA CTGCAGTTTA CGGCGTAAGG GGAGCTAACG GTGTAATTTT GTTGACTACC
AAACGTGGTT CAGTGGGTAA GCCGGCTGTC ACCTTCAGGT CAGAGGCGGC AGCACTTCAT
GCCATGCGTT TACCGGAGTA CATCAATGCC GGGCAATATG CTTCGTTAAT GAATGAGGCG
CGCATCAACT CCGGAAATTC ACCAACCTGG AGTGATGAGG AAATACGGAA ATTCAAAGAT
GGATCAGATC CTTACCTGTA TCCGAATTCC AACTGGACCG ATGCCGTATT AAAAAAAGAT
ACCTGGCAAA CCATTCAAAA CCTCAGCGTA ACAGGAGGTA GTGATGTCAT TAAATACTAT
ACCAACGTAG GTTTTACTTT GCAGGACGGA ATCTATAAAC AAGACAATAA CAATCCCTAC
AATACCAACG CAAACATAAA ACGGTATAAT TTTCGCAGTA ACGTGGACAT CAACCTCTCA
AAAAGTTTGT ATATGCAATT GGGAATTGGT GGTATCATCC ACAAAGGTAA TTATCCGGGA
TGGTCTGCGC CAGATATTTT CAATGCACTA AAAGTCATTT CGCCCATCGC TTATCCGGTA
ACCAACCCAG ACGGCACGCC TGGCGGCGCA GCAACTTATT TAGGATGGAA TCCTTGGGCC
AGGGCAACCC AGTCAGGCTA TACAACTCAG GACCGCCTCA ACCTGCAAGG TACTTTCGCC
ATGAAATGGG ACTTGTCCTC TTTTACAACG AAAGGACTTT CACTCAGGGC ATTGTTTGCC
TATGATCGTT ACACGCAAAC GGATAATCCC AGAAGAAAAG CTTTTCTTGT GAAACGTTAT
CTCGGTAAAG ATCCGATAAG CGGTAAAGAC CTTTACAGCA CTCCTTTTCA GGAAGAACAA
CCGCTTGGTT ATGGTGTGGG GGGATACAGC AACCGCGCCA TATATACCGA AGCGCAGATC
AATTACGAGC GATCGTTCGG TAAACATAGC GTAACTTCCA TGCTCCTGCT AAATGAGCGT
GATTACGTCG ATCTGTCTGC CGGTACTTCT GTTGCAAACC TGCCCTATCG CAGAAGAGGC
CTTTCCGGTA GAACCACCTA TAATTATGAC AATCGTTATT TGGTAGAATT CAATTTTGGT
TATAACGGTT CAGAAAACTT CCCTGACGGT AAGAAATATG GATTTTTTCC TGCTGCTTCT
GCAGGCTGGG TAGTCTCTAA TGAAAAGTTC TGGAAAGTGA ATTTTGTAAG TAACCTGAAA
ATAAGAGCTT CAAGAGGTTT AGTAGGAAAT GACAATATTT CCCAACGTTT CTTATTCCTG
AGCACAATCC GGACCAACGG ACAGTCTTAT CTTTTCGGAG CCGATCAGCA ATTGTTTAAC
GGAATGGAAG AAGAGGCGAT TGGAAATCCG AATGTAACCT GGGAAAGGGC CACCAAAAAC
AACATTGGTA TAGATCTGGG TTTATTTAAA GACAGAATTA CCCTGCAAAT TGATGCTTTT
AATGAAGATA GAAAAGACAT CTTACTTAGA AGAGGTACAG TACCTGATTT TGCAGGTTTC
TTTCCCTGGT CTATTCCTTA TGGCAACCTC GGCCGGATCA AAAATAAGGG TATTGATGGT
TTATTGGAAA TCAAAAATAC AACCAGCAAA GGCCTTTTTT ATTCTCTCAG AACAAACTTT
ACCTGGGCCA GAAATACCAT CGTCGAAAAC GATGAACCCA GCCGGAAATA TGCTTACTTA
TCTGGCAAAG GCCTCCCTTT ACTACAGCCC CTGGGTTTTG TAGCAGATGG CTTCTTCTCC
AGTCTGGACG AAATTGAAAC CAGCCCAAGG CAAACATTTT CCAGGCCAAG GGTAGGAGAT
GTTAAATATA AAGACATTGA TGGTGATGGA CTAATTGATG CAAACGATCG GATCCCAATC
GGCTATGCGC GGTTGCCGCA AATGACCTTT GGCTTTGGTG GTACTGTGGC TTACAAAGGC
TTCGATGCCA GTGTATATTT TACAGGCGCA GCACAAACAA GTCTGATGTT GAGCGGAACT
TCCATGTGGC CATTCTTTGA TGGCGCAGGG GTAAACAATG TGCTGACCGA GTATTACGAC
AATCGTTGGA CTCCAGAAAA CAGGAACAAC GCACTATATC CTGCAATTGA TGAGGGTAAC
AACCCAAACA ATTTTGTCAA TTCAACCTTG TACATGCGCA ATGGCGATTA CCTGAGGTTA
CGTAATGCGG AGATCGGTTA CTCCTTACCA AAACGCTTCA ACAATAAAAT TGGCGTATCA
AATATGAGGT TATTCATAAA CGCAGTCAAT TTATATACCT GGGACCACAT TAAGATCATC
GATCCGGAAT CAAATGATGG TACTGGTGGC TATCCTTTGC AGCGATCTTT TAACGCCGGA
CTACAAATTG ACTTCAAATA A
 
Protein sequence
MITEAKAEDF FYQEARAKLI TGKVIDENKL PLIGVSVKIK GGTGGAITNA EGDFTLSVKS 
EKDSIQFSYI GYKTQTISAG VTGFVTIQMT PRSKENLDEV AIVGYGTQKK VSVTGAISTI
SVSEMQKVST PSLSNAIAGK LPGIITRQAT GEPGYDAAAI YIRGLSTFGQ NAPLVLIDGV
ERDMNQINAQ EIESFTILKD ASATAVYGVR GANGVILLTT KRGSVGKPAV TFRSEAAALH
AMRLPEYINA GQYASLMNEA RINSGNSPTW SDEEIRKFKD GSDPYLYPNS NWTDAVLKKD
TWQTIQNLSV TGGSDVIKYY TNVGFTLQDG IYKQDNNNPY NTNANIKRYN FRSNVDINLS
KSLYMQLGIG GIIHKGNYPG WSAPDIFNAL KVISPIAYPV TNPDGTPGGA ATYLGWNPWA
RATQSGYTTQ DRLNLQGTFA MKWDLSSFTT KGLSLRALFA YDRYTQTDNP RRKAFLVKRY
LGKDPISGKD LYSTPFQEEQ PLGYGVGGYS NRAIYTEAQI NYERSFGKHS VTSMLLLNER
DYVDLSAGTS VANLPYRRRG LSGRTTYNYD NRYLVEFNFG YNGSENFPDG KKYGFFPAAS
AGWVVSNEKF WKVNFVSNLK IRASRGLVGN DNISQRFLFL STIRTNGQSY LFGADQQLFN
GMEEEAIGNP NVTWERATKN NIGIDLGLFK DRITLQIDAF NEDRKDILLR RGTVPDFAGF
FPWSIPYGNL GRIKNKGIDG LLEIKNTTSK GLFYSLRTNF TWARNTIVEN DEPSRKYAYL
SGKGLPLLQP LGFVADGFFS SLDEIETSPR QTFSRPRVGD VKYKDIDGDG LIDANDRIPI
GYARLPQMTF GFGGTVAYKG FDASVYFTGA AQTSLMLSGT SMWPFFDGAG VNNVLTEYYD
NRWTPENRNN ALYPAIDEGN NPNNFVNSTL YMRNGDYLRL RNAEIGYSLP KRFNNKIGVS
NMRLFINAVN LYTWDHIKII DPESNDGTGG YPLQRSFNAG LQIDFK