Gene Phep_1153 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1153 
Symbol 
ID8252247 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1351677 
End bp1354868 
Gene Length3192 bp 
Protein Length1063 aa 
Translation table11 
GC content41% 
IMG OID644934804 
ProductTonB-dependent receptor 
Protein accessionYP_003091433 
Protein GI255531061 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.192441 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGAA TTTTACTATT ATTTAGTATG CTCATCCTAT GTTTCGGATT AGCAGATGCC 
CAGCAAACCA GGCAAATAAC GGGCAAGGTA ACAGACAAAA CGAGCAAACA ATCTATTCCC
GGAGTATCTG TAACTGTAAA AGGTGCCGGC AATGTGGTCA GCACAGATGA AAAGGGACAA
TTTAAAATAA ATGTTCCTGC AACCGGAAAC ATTGTTTTGA TTGCCAGATA CGTAGGTTAT
AAACCTCAGG AAATTACGGT AGGAAATGAG TCCAAAATTA ACTTCGTACT CGAAGAAAAC
ATTTCCTCAT TGGATGAAGT GGTGATCAAT ATTGGTTACG GTACAGTACG TAAAAAAGAC
CTGACGGGTG CTGTTTCTTC TGTAGGTGCA GATGTAATTG CTGCTGCACC GGTTTCTTCA
GCTTTAGAAG CTATTCAAGG GCGTGTAGCG GGGGTAAACA TTACTTCAAC AGAAGGCTCG
CCTGATGCCG AAATGGTGGT AAGGGTAAGG GGCGGTGGCT CAATTACACA GAGTAATTCT
CCGCTTTACA TTGTAGATGG ATTCCCGGTA GCCTCTATAT CTGACATCGC ACCGCAGGAT
ATCGAATCGG TTGATATCTT AAAAGATGCT TCTTCCACAG CTATTTATGG TTCAAGGGGT
GCCAATGGGG TGGTACTGGT TACCACAAAA GGTGGTAAAA GTGGTAAAAC GAATATCAGC
TATAATGTAT TTACCGGTAC CAGAAAACTA GCCAATAAAC TGGATGTTTT AACACCTTTT
GATTATGCAT CCTGGCAGTA CGAAAGAGCA CTTTTGAGCA ATGCACTCGA TGTGTATACC
AAATATCTGG GCAACTATCA GGATATAGAC CTTTACAGAA ATACACCGGC CAACGATTGG
CAGGAGCTGG TTTTTGGCAG GACAGGTACT ACTTTTAACC AGAACCTGAA CATCAGCGGA
GGGGGTGAAA AAACAAAATA TAGCCTTAGC CATAGTTTTG TAAAAGACAA ATCCATTATG
CAGCTCTCCA GTTATGAACG GCAAAACGTT AACTTTAAAA TGAACCATAA ACTGTACAGT
AAGCTTACAC TGGATGTTGG ACTACGTTAT GCAGATACTA AAACAAAAGG CGGGGGCGTA
AATGAACAGA ATGAGGTATC TTCTGCAGAT TCGCGTTTAA AAAACGCCAT GATCTATCCG
CCATTTCCAA TACAGGGGCT TACTACAACT ACTGAAACGG ATGATACATT TAATTTGTAT
GATCCATTGG TATCTATTTC TGATAATGAC CAATACATCC ACCGTAAAAC CTATAATTTA
AGTGCTGCAC TTACCTACGA AATCATAGAT AACCTTAAGT TGCGCTCAGA AGTCGGTTAT
GACGGTTACA GAAATGACCA GGACCGGTTT TATGGTACAA CAACCTATTA TGTAAGAAAT
GTACCCACCA GCGAAAACCA GAACCTTCCA GCAATTATTT TTTCCAATAC CAACAGGAAC
AGCCTGAGAA ATACAAATAC GCTGAGCTAT AGCTTCAATA AACTGTTGAA AAAAGACCAC
AATCTTACCG TATTGGCCGG ACAGGAATTC ATTAAAATTG AGGAAGGTAC AATGACCAAT
GTAGTACATG GCTTCCCTAA AACATTTACT TTTCAGAACG CCAGGGTTTT GTCTACCCAG
GGTAAGGCAA ACTCCATAGA TAATAATTTT TCCCCTGATG ATAAGCTTTT GTCGTTTTTT
GGCAGGGCCA ATTATGATTA TCTGGGCCGA TATTTGTTAA GCGCGACATT CAGGGCCGAC
GGCTCTTCGA AATTTTCAAC TGAAAACCGC TGGGGATATT TCCCTTCTGT TTCTGCAGCC
TGGAGAATTT CAGAAGAGAG CTTTATGGAG GATACTAAAT CATGGCTGGC AGACCTGAAA
TTAAGGGCCA GCTATGGTAC TGCAGGGAAT AACAATATTC CACCCGGACA AATCGCTCAA
ACGTTTCAAA ATTCTACCAC CACCTGGGTA AATGGGTTTA ACAGCTATTG GGCACCATCC
AAAATTATGG CCAATCCGGA TTTGAAATGG GAAACCACTG TTACAAGGAA TATTGGTGTC
GATTTTGCCC TGTTTAATTC TAAAGTTACA GGTACAATTG ATGCCTATTT GAACCGCACA
AACGACCTTT TAATTCAATT CCCGGTTTCA GGAACTGGTT ATGATTTTCA GTTCCGGAAC
ATTGGAAAAA CCCAGAATAA AGGTTTAGAG TTCTCGGCAA GCTGGAACGC GGTAAAGAAA
TCCAATTTTG ATCTGTCTGT GAATGCAAAT ATAAGTTTCA ATAGAAATAA GGTAATTTCA
TTAGGTACAG TAAAAAACAT TAACGGTACA TCCGGATGGG CCTCAACAGA AATTGGTGTG
GATTATCTGG TAGAAGAAGG AGCCTCGATT GGCAGAATTT ACGGCTACCA GAGTGATGGC
AGGTATGAAG TTTCAGATTT TACGGGATAC AATGCTGCCA CAGGGAACTG GACATTAAAA
GATGGTGTAG CCGATGCAAG TACGGTTATT AGTACGCTGC GTCCGGGATC TATGAAACTT
AAAAATCTTA CGGGCGACCT GAAGATTGAT CAGAACGATA AGACTGTAAT TGGGAATGCC
AATCCCCTGC ATACAGGTGG CTTTTCTATC AATTCCAGAA TCTATAGTTT TGATATAGGT
GCTTTCTTTA ACTGGAGTTA TGGCAACGAT ATCTATAATG CAAATAAAAT AGAATACACT
TCTACCAGTA AGTACAATTC AAGAAACATG ATCTCAACTA TGGCAACGGG CAATCGCTGG
ACAAACCTGC GGGCAGATGG GACGATCAGT AACGATCCGG CCGAATTGAC TGCTATGAAC
GCGAATACCA CGCTCTGGTC GCCCTATATG AGGTCTTTTG TGTTGAGCGA CTGGGCCATA
GAAGATGGAT CTTTTTTAAG GTTGTCTACC ATAACACTGG GTTATACCCT ACCCAGAAAT
TTATCAGCTA AACTAAAAAT GCAAAAATTA AGATTATATG CATCAGCTTA TAATTTATGG
CTGTTAACCG ATTATTCAGG GTTTGACCCC GAAGTTTCGA CAAGGCGCAA AACAGGGTTA
ACGCCGGGAG TTGATTATTC AGCTTATCCA AGAAGCAGAT CTTTTGTTTT TGGATTGAAT
GTTAATTTTT AG
 
Protein sequence
MKRILLLFSM LILCFGLADA QQTRQITGKV TDKTSKQSIP GVSVTVKGAG NVVSTDEKGQ 
FKINVPATGN IVLIARYVGY KPQEITVGNE SKINFVLEEN ISSLDEVVIN IGYGTVRKKD
LTGAVSSVGA DVIAAAPVSS ALEAIQGRVA GVNITSTEGS PDAEMVVRVR GGGSITQSNS
PLYIVDGFPV ASISDIAPQD IESVDILKDA SSTAIYGSRG ANGVVLVTTK GGKSGKTNIS
YNVFTGTRKL ANKLDVLTPF DYASWQYERA LLSNALDVYT KYLGNYQDID LYRNTPANDW
QELVFGRTGT TFNQNLNISG GGEKTKYSLS HSFVKDKSIM QLSSYERQNV NFKMNHKLYS
KLTLDVGLRY ADTKTKGGGV NEQNEVSSAD SRLKNAMIYP PFPIQGLTTT TETDDTFNLY
DPLVSISDND QYIHRKTYNL SAALTYEIID NLKLRSEVGY DGYRNDQDRF YGTTTYYVRN
VPTSENQNLP AIIFSNTNRN SLRNTNTLSY SFNKLLKKDH NLTVLAGQEF IKIEEGTMTN
VVHGFPKTFT FQNARVLSTQ GKANSIDNNF SPDDKLLSFF GRANYDYLGR YLLSATFRAD
GSSKFSTENR WGYFPSVSAA WRISEESFME DTKSWLADLK LRASYGTAGN NNIPPGQIAQ
TFQNSTTTWV NGFNSYWAPS KIMANPDLKW ETTVTRNIGV DFALFNSKVT GTIDAYLNRT
NDLLIQFPVS GTGYDFQFRN IGKTQNKGLE FSASWNAVKK SNFDLSVNAN ISFNRNKVIS
LGTVKNINGT SGWASTEIGV DYLVEEGASI GRIYGYQSDG RYEVSDFTGY NAATGNWTLK
DGVADASTVI STLRPGSMKL KNLTGDLKID QNDKTVIGNA NPLHTGGFSI NSRIYSFDIG
AFFNWSYGND IYNANKIEYT STSKYNSRNM ISTMATGNRW TNLRADGTIS NDPAELTAMN
ANTTLWSPYM RSFVLSDWAI EDGSFLRLST ITLGYTLPRN LSAKLKMQKL RLYASAYNLW
LLTDYSGFDP EVSTRRKTGL TPGVDYSAYP RSRSFVFGLN VNF