Gene Phep_1124 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1124 
Symbol 
ID8252218 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1311514 
End bp1314375 
Gene Length2862 bp 
Protein Length953 aa 
Translation table11 
GC content40% 
IMG OID644934775 
ProductTonB-dependent receptor 
Protein accessionYP_003091404 
Protein GI255531032 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.223448 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00360467 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACTAAAA TAATTACTCA ATTTACCAGA CGAGTCTTTA CAGGAAGCGT GTTTCTGCTG 
ATGATTGGTT ATAATGCCAA CGCACAGGAG CCGAATTCAC AAGACTCAAC CAAAAATTTA
TCACTTGGTA ACCAGGACCA GGAGATTAAA GTTCTTTACG GATCTAAGTC GAAGAGGTAT
ATTACAGGTG CCGTCTCTTC TATTTCAGGA GATGTGTTAA GTAATATTCC CGGTACTAAC
CGGCTAAGTG CGTTGTCAGG TCGTTTACCG GGTCTTTCAA TAATCCAGGG TAAAGCTACT
CCTGGAGCTG AGAGTATTTC GTACCAGATT CGGGGCTTTC ATAATCTAAA TGGTTCGGAT
TCACCCTGGA TAATTGTTAA CGGCAGGGTG GATGATGCAA GCCAGATTGA ACCTAACGAT
ATTGAAAGTA TCACCATACT GAAGGATGCA GCGGCTACAG CTTTATACGG CATGAACGCT
TCACAAGGCA TTATACTTGT AACCACTAAG CGGGGTAAAG TAGGAAAATT AAAGATTAAT
TATAATGTTG AATCGACCTT TCAGCAACCA ACCAGAATGC CTAAATTCCT CGACGCTTAT
AATTATGCAG TGCTTTATAA TGAAGCACAA CTAAATGATA ATCCGATTGC TGTGCCGAAA
TACAATGCTA CAGCACTACA GGCTTACAAA GATGGAAGCA ATCCTTACTT GTATCCGAAT
GTGGATTGGT CTGATGAACT GGTGAAGGAC CATTCATTGC AAATTCGAAA TAATGTAAAT
GTAAGTGGTG GTTCGGATAA TGCAAAGTAT TATTTTTCAG CAAGCTATCT CACGGATGAT
GGTATCTTTA ATACGGATAA AAGTATAAAT ACATACAGCA CTAACTCTAA TCTTGATCTT
ATTAATGTAA GGGCTAGTGT TGATCTGAAA ATAACCAAAA ATCTGAAAAT ATTTGCAGAC
CTCCGTTCAA AAAGGGACAA GCGTAATGCC CCGGGAGCCT ACAATACACT TTTCGACGAA
AACCTTTTTA ATTCAATATA CGCAACCCCG TCTAATGCTT ATCCGATGAA AAATGCTGAT
GGTTCATTGG GTGGAAATAT TACTTATCAG AATAACCCTT ATGGAGGGCT TAATTACGCT
GGCTATAATA ACATTGTCGG AGCTTCGATG TCATCATTCG CTGAATTGGA CTATGATTTT
AACAGCTTAT TGAAGGGATT GACGCTCAAA GCCAGACTCG GTTTTTCCAG TATATCATCA
TTTTATACAA ACCGTGCCAA GAATTTTGCA GTATATTCAC TAAATCCTGC CGGTTCCGCC
ACGCCTTATA CTCAACTTGG AGCAACTACT GTAATCGTAC CAGGTGGCGC ATATATTAAT
CGGAACCGGA TCTATGACCA TTCATTAGCA GCGAATTATC ACAGGGTTTT TGGCAATCAT
AATATCAGTT CAATGCTGAT GTATGAACGG CAGCAATTTG ATACTAATTC TGGTGATGGC
AATGTTGCAA ACTCGGGAAG ACAAACTAAA AATTACCAGG GGCCAAAGGG AAGCGTATCA
TACCGGTTTA AAGATAGGTA CCTGTTGGAT ATTGCAGCTT CTTATATGGG AAATGAGCAA
TATCCTGAAG GTGACCGCTA TGGCTTTTTT CCTGCGGTAT CTGCAGGCTG GATTGTTTCA
GATGAATCTT TCATGAAAGG AAGTTTAATT GATTTCTTAA AGATCAGAGG CTCATACGGA
AGAACCGGAA ACGTTGCGCC AGGAGACTTG AATTTTAGCT ATTACGGTTC TTATGCTGTT
GCGGCAGGAT ATGGTGCTTA TTTCGGTACT ACTCCCACAG CAAGTTCTGG TGTATATCAA
AGCCAGATAG CCAATCCTCT GGTCACCTGG GAAAAGACAC TGAAATCTAA TGCAGGATTG
GATATAGCCC TGTTAAATAA TAAGTTTAAT GCTTCTTTTG ATTATTTCAA TGAAGAAACT
AAAGATATTC TTATTTCCGG TGATATTACA GTGATGTACG GAGGCGGAAG TACTACTTCG
GTTCCTTCCG GCATTTTCAA GAACAAGGGG TTTGAGGTTC AGGCAGGATG GACAGACAAG
ATCAAAGATT TTCAGTATTC TGTAACAGCC AACTATTCTT TTGCAAAAAA TAAAGTTATA
GAAAACGGTG AGGTAGAAAA GCAGTATTCT TGGATGCAAA CGATAGGAAA TCCTTTACAA
ACCCGAATGG GATATGTTTT TGACCGGTTC TATACCGAAA CCGACAACAT TGCAAGTTTA
CCCAGCCAGT CATCACTGGG CACACAGAAA CCGGGTGATC TGAAGTATAA GGATCTGAAT
GGTGATGGAA TAATTAATGA GAACGATATT ACCGTTATTG GCAAGGCCAG AATACCACAA
AGTAACTTCG GATTGAACCT TGGAGCTCAG TACAAAGGGT TTGATTTGAA TGTGTTCTTT
CATGGAACGC AAGGAGGTAC AACCTATAAT TCAGGCCGTA CTTATTATGC ATTTGTTGGA
CAAACGGGAA ATGCGTTGGA GCATCACCTT GGCAGATGGA CTCCTGGGTC GGGGCAATCA
GCGACGTACC CTCGATTAAG TCTGACTAAT ACGAATAATA CCGCAGTTAG TTCATACTGG
GTAAAAGATA ACTCTTTTGT AAGGCTTAAA TACGCAGAAT TGGGGTATAC ACTCCCAGCC
AGGCTTGTAG GAAAGATCGG AATAACCGGC ACACGGATTT TTGTAAATGG CAATAATTTA
TTCCTATGGG ATGAGGTCAA ACTAAAAGAT CCGGAAATGG AAAATCCGGT AGGCTATCCA
TTGCTGCGCT CTTTTAGTGT AGGCCTTAAT GTGAAATTTT AA
 
Protein sequence
MTKIITQFTR RVFTGSVFLL MIGYNANAQE PNSQDSTKNL SLGNQDQEIK VLYGSKSKRY 
ITGAVSSISG DVLSNIPGTN RLSALSGRLP GLSIIQGKAT PGAESISYQI RGFHNLNGSD
SPWIIVNGRV DDASQIEPND IESITILKDA AATALYGMNA SQGIILVTTK RGKVGKLKIN
YNVESTFQQP TRMPKFLDAY NYAVLYNEAQ LNDNPIAVPK YNATALQAYK DGSNPYLYPN
VDWSDELVKD HSLQIRNNVN VSGGSDNAKY YFSASYLTDD GIFNTDKSIN TYSTNSNLDL
INVRASVDLK ITKNLKIFAD LRSKRDKRNA PGAYNTLFDE NLFNSIYATP SNAYPMKNAD
GSLGGNITYQ NNPYGGLNYA GYNNIVGASM SSFAELDYDF NSLLKGLTLK ARLGFSSISS
FYTNRAKNFA VYSLNPAGSA TPYTQLGATT VIVPGGAYIN RNRIYDHSLA ANYHRVFGNH
NISSMLMYER QQFDTNSGDG NVANSGRQTK NYQGPKGSVS YRFKDRYLLD IAASYMGNEQ
YPEGDRYGFF PAVSAGWIVS DESFMKGSLI DFLKIRGSYG RTGNVAPGDL NFSYYGSYAV
AAGYGAYFGT TPTASSGVYQ SQIANPLVTW EKTLKSNAGL DIALLNNKFN ASFDYFNEET
KDILISGDIT VMYGGGSTTS VPSGIFKNKG FEVQAGWTDK IKDFQYSVTA NYSFAKNKVI
ENGEVEKQYS WMQTIGNPLQ TRMGYVFDRF YTETDNIASL PSQSSLGTQK PGDLKYKDLN
GDGIINENDI TVIGKARIPQ SNFGLNLGAQ YKGFDLNVFF HGTQGGTTYN SGRTYYAFVG
QTGNALEHHL GRWTPGSGQS ATYPRLSLTN TNNTAVSSYW VKDNSFVRLK YAELGYTLPA
RLVGKIGITG TRIFVNGNNL FLWDEVKLKD PEMENPVGYP LLRSFSVGLN VKF