Gene Phep_0445 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_0445 
Symbol 
ID8251530 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp526674 
End bp529805 
Gene Length3132 bp 
Protein Length1043 aa 
Translation table11 
GC content42% 
IMG OID644934093 
ProductTonB-dependent receptor plug 
Protein accessionYP_003090731 
Protein GI255530359 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAG TTTTACAAAG AATACTAATG GGCATATTGG TGTTACTGTC AGCTTCAACA 
TCAATTTATG CGCAGACTAT CAAGATCTCA GGAACTGTAA CTGATGATAA AGGAGAAGTG
ATACCAGGGG TAAGTGTACA GGTAAAAGGT ACAAAAACGG CTGTCCAGAC CGGACAGGAC
GGTAAATATT CGATTGCGAT AAATGATGCG CAGGCGGTAT TGATTTTTAG TTATATCGGA
TTCGATAAGA AAGAAGAAAC AGTTGGTTCC AGACGTACGA TCAACACTAA ACTGACATCC
TCATTATCAG ATCTGGACGA GATAATTGTA GTTGGCTACG GGCAACAGAA AAAAAGGGAT
GTAACCGGGG CAATCAGTTC TATCAGTGCA AAAACTATAG AGGAAAAACA ACCCATATCT
ATTTTTGATG CCATACAGGG AGCGGCTCCG GGTGTAAGGG TAATGAGTAG CTCTGGTGCG
CCGGGAGACG AAAGTGACAT TACCATTCGG GGTATGTCGA CGCTATCGGA TGATGGTGTT
AAACCTTTGT ACATTGTGGA TGGGGTTCCT ATGAAAAATA TTATGTCTAT CAATCCTAAA
GATATATTAT CCATCGAAAT CCTTAAAGAT GCCGCATCGG CAGCCATTTA TGGTTCCCGT
TCGGCAAATG GTGTAATCCT GATTACGACT AAACAGGGCG AAGATGGTAA ACCAAGAATC
AATGTTGATT ACCTGAGAAG TTACAGTACG CTGTCGCGTC ACATTCCGCA GTCCAACCGT
TTGCAACGAC AAATGTTTGA TCAGCGAGGC AAAATCAGTC TGGACCCTCA CCCGGATGAT
TCAACAGCTT ACAATAAAAA TGCAGATAAT GATTACCAAT CGTTAATTAC ACAGACCGCC
ATACGCAATC AGTTTGATTT GTCTATGAGC GGTGGAACAC AGAAACTGAA TTACTACAAT
AGCTTGCAGT ATCTTGATGA GGAGGGGATT GTATTGGCTA GCTTTAACAA AAGAGCCACA
ATGCGTACCA ACCTGAAATA TCAGCCCTCT AAAAAGGTAG GCTTGTCTAC ACAGCTCAGT
TTTAGTTACC AGAATAAAAA CAACATTAAC GAGGGTAAGG TCATTCAGCA GGCCCTTCAA
CGTGGCCCGC AGCAAACCCT GTATTTGCCC AGCGGAGAGT TGATTTATGA CAATGGGGGA
AGGAAAAATC CGATAGCAGA AGCTTATCTC AGAGAGAACC TGACTTCCAT CTACAGGGCG
GTGATTTTTC AGGGCTTTGA CTATTCCTTT ACGAACGACT TAAAGTTCCA TGCAGATGCA
TCGGCGGACG TACAGTTTAA CAGAAATACC ACGTTCAACT CCAAATTATT GGAAAACGGG
GCCAGCGCAG CCAGTACGGG TAAAGACGAA ACAACTATAC CAGTCAGGAC CCAGGGAAAT
GCGGTGCTGA ACTATAAAAA AACAATTGCC GTAAATCACA ACCTTACGGC AATGCTGGGG
GCCAACTTTG AAAAAAATAA CCTGAGTGAA ATACAAATTA CAGGCAGAAA ATTTGTAAGT
GAAAACGTAC ATACCTTAAA CGCCGCAGGC GAGTTGACGC CATCTGATAC TTATTCCAAA
GGCAGCGGAT CGGCACTGGT AGGTTTCTTT ACCAGGGTCG GTTATGACTA TAAAGGACGT
TACCTGTTGA ATGCAACGTT AAGAAGGGAC GGGTCATCTG TATTTGGTAC AGAAAATCAA
TGGGGCTATT TCCCTGCAGG ATCAATTGGA TGGAGGTTTA GCGATGAGAA CTTCATGAAA
TGGTCTAAAA AGATATTAAC TGATGCCAAA CTGAGAGGTA GCTGGGGTAT TACCGGTAAC
CAGCAGATTG GTGATTTTGA CGCTGTTCAG CAATTTATTT TTGGTAACTA TTATTACAAT
AATGTTAGCG GGGTGCGTAC CGATACCAGG TTGGGGAACA GTAGCCTGAA ATGGGAAGAG
ACTACCCAGT CAAACGTGGG TATGGATTTA ACCTTCTTTG ACGGTAAATT GAGTTTTGTG
GGTGATTACT ACATCAAAAA AACAAAAGAT CTATTATATG ACGCACCACT TCCTTTAGAA
TCCGGCTTCC CTAACAAAGC CAGGATCAAT GCGGGTTCGC TTCAAAACAA AGGTATCGAA
CTAATGGTTT CAGGATATCC TGTCCAAACC AAAGAATTTA CCTGGCAAAC ATCCATCAAC
TGGTCAAGGG TAAGGAACAA AATCCTTAGC CTGCCGGGTG GTGATTATGT AGATGACAAC
TGGATTGTTG CACAAGGTAA AGAAGCCGGG AACTTCTTTG GTTACAGATT CCTGGGCATT
TATCAGTACG ATCAGTCCAA TGCCTATACA GATGACTTTA AAGTCAGACT GATCCCACAA
TTTCAGAAAG ATGAATTGGG TAATGTGGTC ATCGGGAAAA ATATGCAGCC AAATTTATTA
GGTTATACTT ATCCTGACGG GACACCTTAC AGCGGAACGC CAAAACAATT GACTACCAAT
GGTAACATTT CTAAGGGTGG TGATGTGATC TGGCAAAATC AGCCCGATGC CAATGGGGTT
TTAAATGGCA ACATTGGTAA CGAAGATAAG ATTGTAACCG GATACGGACA GCCAAGATGG
AGCTTGGGTT GGAGCAATAA CTTTACTTAC AAGAACTTCT CTCTGGCAGT TTCTCTGTAC
GGAAATTTTG GAAACAGTAT CTATAATGAA AACAGAAGAA ATACTGCTTC TTTTTCCAAT
AGCAATACGA CTCCCGAGCC TTATTATATC CTGAACATGT ACAAATATCC GGGACAAATT
ACCGAGTCAT ACATTGGTGG GACCCAAGGT TCTGATAACG TGCGTAAAAC CAACGATTAC
TATCTGGAGG ATGGTTCCTT TATACGTTTG CAGAATGTCA GACTGGGATA TCAGCTGCCA
AGAAAACTTA TTGAGCGGTT CCATATTGCT AACTTTAACC TGTATGTTTA TGGTAACAAT
CTGTTGACCT GGACAAAATA CACAGGTTTT GACCCGGAAG TGAAACAAAA TAGTGTGCTT
AAACCTGGGC TTGATAACGG ACAATATCCG CGCAAAAGAG AGATGGGATT AGGACTTGGT
ATAACATTTT AA
 
Protein sequence
MKKVLQRILM GILVLLSAST SIYAQTIKIS GTVTDDKGEV IPGVSVQVKG TKTAVQTGQD 
GKYSIAINDA QAVLIFSYIG FDKKEETVGS RRTINTKLTS SLSDLDEIIV VGYGQQKKRD
VTGAISSISA KTIEEKQPIS IFDAIQGAAP GVRVMSSSGA PGDESDITIR GMSTLSDDGV
KPLYIVDGVP MKNIMSINPK DILSIEILKD AASAAIYGSR SANGVILITT KQGEDGKPRI
NVDYLRSYST LSRHIPQSNR LQRQMFDQRG KISLDPHPDD STAYNKNADN DYQSLITQTA
IRNQFDLSMS GGTQKLNYYN SLQYLDEEGI VLASFNKRAT MRTNLKYQPS KKVGLSTQLS
FSYQNKNNIN EGKVIQQALQ RGPQQTLYLP SGELIYDNGG RKNPIAEAYL RENLTSIYRA
VIFQGFDYSF TNDLKFHADA SADVQFNRNT TFNSKLLENG ASAASTGKDE TTIPVRTQGN
AVLNYKKTIA VNHNLTAMLG ANFEKNNLSE IQITGRKFVS ENVHTLNAAG ELTPSDTYSK
GSGSALVGFF TRVGYDYKGR YLLNATLRRD GSSVFGTENQ WGYFPAGSIG WRFSDENFMK
WSKKILTDAK LRGSWGITGN QQIGDFDAVQ QFIFGNYYYN NVSGVRTDTR LGNSSLKWEE
TTQSNVGMDL TFFDGKLSFV GDYYIKKTKD LLYDAPLPLE SGFPNKARIN AGSLQNKGIE
LMVSGYPVQT KEFTWQTSIN WSRVRNKILS LPGGDYVDDN WIVAQGKEAG NFFGYRFLGI
YQYDQSNAYT DDFKVRLIPQ FQKDELGNVV IGKNMQPNLL GYTYPDGTPY SGTPKQLTTN
GNISKGGDVI WQNQPDANGV LNGNIGNEDK IVTGYGQPRW SLGWSNNFTY KNFSLAVSLY
GNFGNSIYNE NRRNTASFSN SNTTPEPYYI LNMYKYPGQI TESYIGGTQG SDNVRKTNDY
YLEDGSFIRL QNVRLGYQLP RKLIERFHIA NFNLYVYGNN LLTWTKYTGF DPEVKQNSVL
KPGLDNGQYP RKREMGLGLG ITF