Gene Phep_3367 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3367 
Symbol 
ID8254486 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3996069 
End bp3999170 
Gene Length3102 bp 
Protein Length1033 aa 
Translation table11 
GC content42% 
IMG OID644937019 
ProductTonB-dependent receptor plug 
Protein accessionYP_003093623 
Protein GI255533251 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.994528 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCAA GAGCGTACTT ATTACTGGTA TTTTTTCTTT TTCCAATGTG GGTCTGTGCA 
CAGCAGCAGA TTGTGGGAAA AGTTGTGGAT GTAAGGGGAG AACCTTTACC TGGAGTTAGT
GTCACTGCAA AAGCAGATGA CGGACAAAAA TTTAATGCAA TTACCAATTC AAATGGTGAT
TATGATTTAC GTGTTTCTGG TGGCACCAAA GAATTAACCT ATACGTATAT GGGTATGATG
CCAGTTACGG AGCTCATAAA GGGCCGTAAC ACCATAAACG TACAGCTGGC TGAAGACAGC
AAGGAACTGC AAAATGTAGT AGTAACAGCC CTAGGTATCA AACGTGAAAT AAAAGCCTTG
AGCTATAGCA GGCAAGGGGT AGATGTAAAC ACGATGACGG AAGCTAAAAG CCCGAACTTG
TTGAGTACCT TATCTGGAAA AATTGCAGGC TTACAGATTG TACCACCTGG TTTTAATACA
GGTTCGGCCA GGGTTGTTAT CAGGGGTAAC AGTTCACTGA CCGGAAACAA CCAGCCACTT
TTTGTGGTAG ACGGGATGCC GATAGACAAT ACGGCAGGGG ATGGCAACAT CGATTATGGA
AATAATGCTG CTGATATCAA TACTGAAGAT ATTGAAAATA TAGAAGTGCT TAAAGGTCCG
AATGCTTCGG CACTTTATGG CTCAAGGGCA GCTAATGGGG TAATTTTGAT TACTACAAAA
AAAGGCACTA CTAAATTTAA GGTGTCGCTT AATTCGAGTT TAATGATGCA GAGATTAACC
GAATTTCCAG AATACCAAAA TGCTTACGGA GTAGGGACTT CATTTTACAT TGACAATACA
CATCGCTTGC CTGAAGCCAT GGTTAATTAT CGCAGTTGGG GGTCGCCTAT GATGGGGCAG
CCTTATGTTG CTTTAAATGG TGAAATCAAA CCCTACCTGC CACAGCCGGA TAACATTCGG
GATTTTTATC AGTCTGCTTC ATTGCTGACC AATAATATCG CCGTAGAAGG TGGGAATACA
AGTAGTATTT ACAGGATTTC CTATACCAAT TATGCCGGAT CCAGTGTAGT GGATGGATTT
AACCTGAGCA ATAAGCAAAC CGGAGACGTA CGCCTGCAAA ATACTTTTAG TAAAAAGGTG
AGCCTGGACA GTAAAATCAG TTACGTAAGA GATGCGGTTG ACAACAGACA GTATTCGAAT
GCGAATGGCC GGAATCCAAC AAACCTGTAT ACGCATATGG GCAGGAGTAC TGATCTTGCC
GAACTCATGC CCTATAAAGA CCCACTAACC GGAATGGAAA TAGGGACGCA TCGTAATTTC
AGCAATCCTT TTTGGGTAAT TAATGAGAAC CCCAACCGGG ACGTTAAAGA CCGTATAATT
GCATCTTTGA ACCCTAAAGT GAATTTCACC AATTGGCTGG TTTTTAATGG ACGCTTAGGT
GCCGACGTGC TGTGGTGGGA TGGTTTTGAA TTTAATAATA TTGGTTCTAT TGTGGCCAGC
AACCCTGATG GTTTTATGCG TACCTTTAAT ACCAAGCAGC AAAATTTTAA CCTGGAAGGT
ACGCTTGTAG CCAATAAGAC CTTTAATAAA TTTTCGGTGA GTACCATGCT GGGTGCCAGT
AGTTTTAGCT CTTGGTTTGA AAGAAGGGAA CAAAGGATCA ATTCTTTATT GCAGCCTGGC
TTAATTAATT TATCCAACGC CAAAGAGTTT CCAACGGTTA CGCAACAGCA ACGTGATAAA
CGTTTAAACT CTGTTTTTGG TTCTGTTTCT TTAGGTTACA GAGGGTATGC TTTTGTTGAT
GTAACAGGTA GAAACGATTG GTCATCTACA TTGCCGAGGG CTAATAATTC TTACTTCTAT
CCCTCCGTTG GCGGCTCATT GATTGTGAAT GAAATATTAG GTTTGAAGAG CGACATCCTT
AGTTTTGCCA AATTGCGCGC ATCTTATGCC ATCGTAGGAA ATGATACCGA TCCGTATAGA
TTAGACCAGA CTTACTCATT TAATGGTTTT TTAAATGGGG CCACCCTGGC TTCACTGGCC
ACTACAATGA ACAACGCAGA TCTTAAGCCC GAAAGGACAA CGTCTTTTGA GTTAGGAATG
GATGTAAGGT TGTTTAAAAA TAGGGTTTCA ATCGATGGCA CCTATTACAA TGCTGCTACT
ACTAACCAGA TTGTAACAGC TCAGCTTCCA TCTTCAAGCG GTTATTTAAA GCGAATTTAT
AATGCCGGAA AAATAAAGAA CTGGGGTTAT GAACTGAGCG GAAATGCAAA GGTTATTGCC
GGGAAGAACT TTTCATGGAC AACCCAGCTT AATTATGCGG CGAACAATTC GAAGGTAGTA
GAGCTGATAG AGGGGATTGA TCGTTTCCAG CTGAATAACA ATTCGAGTTA CCTGTATGTA
TATGCTGAAG TGGGAAAACC ATATGCCTAT TTGCGGGGTT TGGGAGTGGC CCGCGATGCC
CAGGGCAGGA TGTTGATCGA GGATGGGGGA TCCTTATTGG TTAAGGATAA TGACATGGCC
TTTGGAACGG CTTCACCGGA TTGGATTGGT GGCATTTACA ATACTTTTAA GTTTAAAAAT
CTGGACCTCG GTTTCCTGGT AGATGTTAAA ATGGGTGGGG TAATGTATTC TGGTAGTATT
TCGCGAATGC TGACGAACGG TGTTTTAGCG GAAACCTTAT ACGGACGCGA TGATTATTAT
AAACATACCG TGATTTTTGG GGAGAACAAT ACAGAGTTAA GTGGTGGTGC AATATGGGAT
GCCTATTTTG CGGATGGGAC TAAAAATACG AAGTTCGTTA CCCCTCAGAA CTACGAATAT
GCAAGGCCGA ATTATGCAGA ATTTGTGATC TACGATGCGT CTTATGTAAA GCTAAGAGAA
GTTACGGTCG GTTATACATT ACCTGTTAAG CTGTTGTCGA AAATGCCGGT TAAAACGGCA
AGGTTCTCTT TATCCGGCAG GAACCTGGCT ATTCTTTATA GAAGAACCCC ACGTGGCCTG
GATCCTGAAG CAATGTCTAC CTCTGGTAAC GGACAGGGAA TTGAGAATGG TGCGTTGCCT
CCGAATGCAA TTTATGGATT GAATATCAGA CTTACTTTTT AA
 
Protein sequence
MKARAYLLLV FFLFPMWVCA QQQIVGKVVD VRGEPLPGVS VTAKADDGQK FNAITNSNGD 
YDLRVSGGTK ELTYTYMGMM PVTELIKGRN TINVQLAEDS KELQNVVVTA LGIKREIKAL
SYSRQGVDVN TMTEAKSPNL LSTLSGKIAG LQIVPPGFNT GSARVVIRGN SSLTGNNQPL
FVVDGMPIDN TAGDGNIDYG NNAADINTED IENIEVLKGP NASALYGSRA ANGVILITTK
KGTTKFKVSL NSSLMMQRLT EFPEYQNAYG VGTSFYIDNT HRLPEAMVNY RSWGSPMMGQ
PYVALNGEIK PYLPQPDNIR DFYQSASLLT NNIAVEGGNT SSIYRISYTN YAGSSVVDGF
NLSNKQTGDV RLQNTFSKKV SLDSKISYVR DAVDNRQYSN ANGRNPTNLY THMGRSTDLA
ELMPYKDPLT GMEIGTHRNF SNPFWVINEN PNRDVKDRII ASLNPKVNFT NWLVFNGRLG
ADVLWWDGFE FNNIGSIVAS NPDGFMRTFN TKQQNFNLEG TLVANKTFNK FSVSTMLGAS
SFSSWFERRE QRINSLLQPG LINLSNAKEF PTVTQQQRDK RLNSVFGSVS LGYRGYAFVD
VTGRNDWSST LPRANNSYFY PSVGGSLIVN EILGLKSDIL SFAKLRASYA IVGNDTDPYR
LDQTYSFNGF LNGATLASLA TTMNNADLKP ERTTSFELGM DVRLFKNRVS IDGTYYNAAT
TNQIVTAQLP SSSGYLKRIY NAGKIKNWGY ELSGNAKVIA GKNFSWTTQL NYAANNSKVV
ELIEGIDRFQ LNNNSSYLYV YAEVGKPYAY LRGLGVARDA QGRMLIEDGG SLLVKDNDMA
FGTASPDWIG GIYNTFKFKN LDLGFLVDVK MGGVMYSGSI SRMLTNGVLA ETLYGRDDYY
KHTVIFGENN TELSGGAIWD AYFADGTKNT KFVTPQNYEY ARPNYAEFVI YDASYVKLRE
VTVGYTLPVK LLSKMPVKTA RFSLSGRNLA ILYRRTPRGL DPEAMSTSGN GQGIENGALP
PNAIYGLNIR LTF