Gene Phep_1121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1121 
Symbol 
ID8252215 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1306052 
End bp1309489 
Gene Length3438 bp 
Protein Length1145 aa 
Translation table11 
GC content44% 
IMG OID644934772 
ProductTonB-dependent receptor plug 
Protein accessionYP_003091401 
Protein GI255531029 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000289736 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAATTAA TAACGTTATT GATAACCATA GCTTTTATTC ATATCAGTAT CAAAGGATAC 
GGTCAGATTA CGCTGAACCA GAAAAATGCA GCACTGGGAA AAATATTTCA GACGATTGAA
AAGCAAACGG GTTATGTGTT TTTGTATACA GATAAAGAGC TGACCAGGAT AAAGATTGAT
ATACAGGTAA AAAATGCGCC GATTGAGGTA ACGCTGAAGG AATGCTTCAA AAACCTGCCT
TATACCTTTA AGATCGTTGA AAACAATATC CTGCTTAAAA AAGAGGATGA CATTGTCGTA
TTGCCTGAAA CAGCTGCTGC CGGATCGGAG ATCCAATTAC AGGGAACGGT GAGTGATGAT
AAAGGTGCGG CCCTGCCTGG CGCAACGATA AGGGTGGAGC ACAGCAGCAT TGGCACAATT
GCCGGTTTAG ACGGTACATT TACCCTCCAG GTTCCGGATG CGAAAGTGAT GCTGATTTTT
TCGTATGTGG GTTTTGTGAC AAAAAAAGTA AGACTTAACG GACTGGCCAG AATGGAGGTG
AGCCTGACGG AGGATGCGGC TGCTTTTGCA GAAATCCAGG TGGTGGGTTA TGGTATTCAG
AAAAGAGCCA ATGTGGTAGG AGCCATTTCC AATATTAACA TGAAGGAACT TAGAAAAGCA
GCACCATCAA ATTTGAGCAA TGCGCTCGGG GGCCGTGTTC CTGGTATAAT TAGCAGAATG
GGAGATGGAA CACCAGGGGG TGTGCAGAAT AGATTCTCGA ATGGAAACGC TGATGATGCC
CAAATTTATT TACGCGGAAG AGCCAGTATG AACAATACAA GTGCCCTGGT CCTGATCGAT
GGTGTGGAAG GTTCCTTGTC CAGAATAAAT CCGGAAGATA TTGAGCAGTT CAGTGTGCTG
AAAGATGCAT CTGCAACAGC AGTTTACGGT GTACGAGGGG CTAATGGCGT GATCCTGATC
ACGACCAGAA AAGGTAGCAT AGGGGCACCA AAGATCGGTA TAACCAGCCA GATCAGGATG
CAAAAGGTAT TGGATTTTCC GAATTTTCTA AGATCCTATG ATTTCGCAAT GTTAAATAAC
GAGGCTCGTA AAAATCAGGG TCTTCCTGAA ATTTATTCAG CGGAAGATCT GGAGCATTAC
CGTACCGGTG ATGATCCTTA TGGCTGGCCG GATGTTGACT GGAAAGAAGT ATTGCTGAAA
GACCAGTTTT ATGAACAACA GTACGTTGGA AATGTTTACG GTGGGACAGA ACGGGTATCC
TATTATTTAT CCGGTGAGTA TAACCAGTCG GGTGGCGCGT TCATTGAAAA TAAAGAGAAG
AATACACAGT ATAGGTACAG ACGTTATAAT CTTAGGACCA ACTTTGATTT TAAGATCACT
AAAACTACTG ATCTGGGTGT GAAATTGAAT GGCAGGTTGA ATGACCTTCA TTACCCACTA
AAAGGTGAAA GTAGCGGACA GCGGGTAACT GGTCCCGGAT GGAGCGATAT TACGGCAAGA
GCCCCGCTCA CTGCCCCGGT ATACAATCCT AACGGTACTT ATGCAAATGG TGGTTCGGAC
CTGCCGGGTA ACCCTGTGGC TGAGTATATG GAGGGCGGTT TTGCCCAGCG GCTGCAAAGC
GGTCTGGAAT CCAACTTTAC GCTGAACCAG AAGTTGGATT TTGTAACACC CGGGCTTTCA
TTCAGGGGCT TATTTGCTGC GAACTTTGGA TCGGGCAGTG CAAAAGCATT AAATTCAAGG
AGTGCTGAGA TCTGGGCGTA TGATAAAATT ACCAAGACCT ATACATTAAT GTTTGGGGCT
GGAATTCCAA CCTATACACT GGGTAGTAAT TTCAGTGATT TCAACCGTAT ACAACAGGTT
GAAGCTGCCT TGAACTATGA TAAGGCGATT GGCATGAACC ACAAAATAAC CGCCATGGGC
ATTGCTACCC AGACCACCAA AGAGGCTTCG TTCATTGTTC CTACTATTTT CAAAGGAATG
GCAGGCAGGC TTACCTATGC TTATAAGGAT AAATACCTTG CTGAAGGAAA TGTAGGCTAT
AATGGTTCTG ATGCATTTAG TAAATCGAAA CGTTATGCAT TTTTTCCTTC CGGTGCCCTT
GGATGGGTAG CTTCAGAAGA AAGTTTTATA AAGGATAATG TTAAGTTTCT GGATTTTCTG
AAGTTCAGGG GCTCCTATGG TGAAGTAGGG AATGACAGGC TCGGTTTCGG ATACAGCAAC
TTGTATATCT ATTCATTTAG AAACCCGCTC GCGGCTGAGA CTCCCGGTAC CTCTACTACA
GTTAACGGCT ATTATAGCCT GGGAACTACA CCTAATCAAA TCCTTCCGAT CTTAGAAGGA
ACGTTGGGAA ATCCAAACGT GACCTGGGAG GTTGCCCGCA AGGCAGATAT CGGGGTGGAG
GCAAAATTGT TTAAAAGCCG TCTCAGCTTT GAAGCAGATG TATTTCTGGA AAAGCGTAAT
GATATTCTGA TCAACAGATT CGATATACCG TTGATTTCTG GTTTAGTACC AGCAAAGCTT
CCTGCATTGA ATGCCGGAAA GGCAACAAAC AAAGGATATG AATTATCGCT TAGTTATTCT
GACAATATTG GTGGCTTTGG TTTTACAGTA GGGGGTAATT ATACTTTTGT CCGCAATACC
ATCGATTACA TGGCTGAAAC GCCGAAGAAA TACCCATGGC AGGAACAGAC GGGCAAACAA
ATTGGCATGC TTGCACCTCA ATTTATCTGG ACGGGTAAAT TTTACAGTGA AGAGGATTTG
ACCAATAATG CTGTTCCCAA ACCGGTTGCA AAAGTTTGGG CCGGCGAACT GATGTTTAAA
GATCTGAATG GCGATGGTAA AATTGACTCA GATGATAAGG CATATACTGG TTATGGTCAG
ATTCCGGAGA AGATATTTGG CATTAACCTG AATATGGATT ACAAGAATTT TTATTTGAAT
ACGTTCTGGC AAGGTGCATC CAACGTGGTC ATTAACCCTA CTGCCGGGAT GCGGCTTGAA
TATGCTGGTT ATGGATACAA TGTTCAGGAG TTCCATAAAG AAGATCGTTG GGTATATGAT
CCTTCCCGCG GATTGGATAC GCGGGCAACG GCAAAATATC CCCTATTGAT GCTCGGAGGC
GCACCGCAAA CCAGGGAGCT TTCTACCTTT CATGTGCTGA ATGGCGAGTA TTTACGTTTG
AAAGCAGCTG AATTTGGGTA TACTTTTCCT AAAACCCTAA TCACAAAGCT TCATATAGCA
GACCTGAGAG TGTTTGTAAG TGGTTCAAAT CTGCTGACTT TCTCTCATTT AAAGAGATAT
CACATCGATC CTGAATATCT TGGAAACAAT ATACCGGGTC AGATGGTTGC TGGTCAGGGT
GAGGCCAATG GACTTGGATC CGGAGCTTGG TCGCCCCAAA ATAAATTTTA TGCCTTTGGG
CTTAACGTTA CTTTTTAA
 
Protein sequence
MKLITLLITI AFIHISIKGY GQITLNQKNA ALGKIFQTIE KQTGYVFLYT DKELTRIKID 
IQVKNAPIEV TLKECFKNLP YTFKIVENNI LLKKEDDIVV LPETAAAGSE IQLQGTVSDD
KGAALPGATI RVEHSSIGTI AGLDGTFTLQ VPDAKVMLIF SYVGFVTKKV RLNGLARMEV
SLTEDAAAFA EIQVVGYGIQ KRANVVGAIS NINMKELRKA APSNLSNALG GRVPGIISRM
GDGTPGGVQN RFSNGNADDA QIYLRGRASM NNTSALVLID GVEGSLSRIN PEDIEQFSVL
KDASATAVYG VRGANGVILI TTRKGSIGAP KIGITSQIRM QKVLDFPNFL RSYDFAMLNN
EARKNQGLPE IYSAEDLEHY RTGDDPYGWP DVDWKEVLLK DQFYEQQYVG NVYGGTERVS
YYLSGEYNQS GGAFIENKEK NTQYRYRRYN LRTNFDFKIT KTTDLGVKLN GRLNDLHYPL
KGESSGQRVT GPGWSDITAR APLTAPVYNP NGTYANGGSD LPGNPVAEYM EGGFAQRLQS
GLESNFTLNQ KLDFVTPGLS FRGLFAANFG SGSAKALNSR SAEIWAYDKI TKTYTLMFGA
GIPTYTLGSN FSDFNRIQQV EAALNYDKAI GMNHKITAMG IATQTTKEAS FIVPTIFKGM
AGRLTYAYKD KYLAEGNVGY NGSDAFSKSK RYAFFPSGAL GWVASEESFI KDNVKFLDFL
KFRGSYGEVG NDRLGFGYSN LYIYSFRNPL AAETPGTSTT VNGYYSLGTT PNQILPILEG
TLGNPNVTWE VARKADIGVE AKLFKSRLSF EADVFLEKRN DILINRFDIP LISGLVPAKL
PALNAGKATN KGYELSLSYS DNIGGFGFTV GGNYTFVRNT IDYMAETPKK YPWQEQTGKQ
IGMLAPQFIW TGKFYSEEDL TNNAVPKPVA KVWAGELMFK DLNGDGKIDS DDKAYTGYGQ
IPEKIFGINL NMDYKNFYLN TFWQGASNVV INPTAGMRLE YAGYGYNVQE FHKEDRWVYD
PSRGLDTRAT AKYPLLMLGG APQTRELSTF HVLNGEYLRL KAAEFGYTFP KTLITKLHIA
DLRVFVSGSN LLTFSHLKRY HIDPEYLGNN IPGQMVAGQG EANGLGSGAW SPQNKFYAFG
LNVTF