Gene Phep_1708 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1708 
Symbol 
ID8252810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2026269 
End bp2029397 
Gene Length3129 bp 
Protein Length1042 aa 
Translation table11 
GC content42% 
IMG OID644935360 
ProductTonB-dependent receptor plug 
Protein accessionYP_003091981 
Protein GI255531609 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0323206 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAATC AAACTTCTAC TCTTCGTTGG TGTATTGTAG TTTTTTTATT GTGTTGTTGT 
GTGGTGGATT CATATTCGAG GGGTAACACA ATCAGTAATG AAAATTATTC TACTAAAAAA
GTAACCGTAT CAGGAAAAGT TTCTGATGCT GAAGGCCCAT TGCCAAGTGT GATCGTAAAG
TTAAAAGGAA CAAACACCTC TACTTCAACC AACGCAAATG GAAATTACAG TTTAACAATA
CCCGATGGTA CAGGTACCTT GGTTTTCTCT TCAATTGGGT ATGTTACCCA GGAAGTTGCG
GTTCAAAACA GAACTACAAT AAATGTAACG CTTGTCGCAG AAAATAAAAC TTTATCAGAG
GTGATCGTTG TTGGTTATGG AACACAAAAG AAATCTGATA TTACAGGTTC TATAGTCTCC
ATCAGCGAAC AGGCACTTAA AGATGTGCCT GTTGCAAATC TGTCTCAGGC CCTTCAGGGA
AGAGGTGCCG GTATAGACAT TCAAAAAAGC GGTGGTAACA GTAAGCCTGG TGCTTCGCCT
GTCATCAGGA TTAGGGGTGC CAGATCACTG GGAGCAACTA ATGATCCGCT ATTTGTTGTT
GATGGTATTC CTTATAATGG GAATATCAAT GACCTGAACC CTGATGATGT GGTATCTGTC
GAAGTGCTTA AAGATGCTTC CTCTACAGCT ATTTATGGGT CCAGAGGTGC AAATGGTGTT
ATCCTGGTGA CTACCAAGCG TGGGAAAATT GGCGAAGCTG TAGTTACTTA TAGCGGTTAT
GCCGGAGTGA CCAAAAACCT GGGTAAAATT GATGTGATGG ATGGTAAACA ATTTGAGATG
TTAAAGAAAT GGGCTGTAGT AAATGGAAAT TTTGTAAGCG GGGCCCCAAA ATACACAGGT
GTAGATGACC CCCGGATTAT GACTGATGGT ATTTTTGCCC CTCAGGAGCT TGAGTCTATC
AAAATGGGCA GAAGTACCGA CTGGCAGGAC CTGATCTATA AAAATGGCAT CACTACAAAC
CATCAGATCG GTGTGTCAGG TGGATCAGAA AAAACGCAGT ATGCCCTGTC CGGAGGCTAT
CATAACGAAA CCGGAATTTA TCCGGGCCAG TCTTTTGAGC GTTTTACAGC TAAAATAAGC
ATAGAGCAAC AATTGGGTAA ATATGTTCGG GTAGGGTTGA ACAGCATAAA CAATTTCAGC
TACACCAAAG GTGAGGGTGC CAACCCCATG GGGCAAGTGC TTCGAGCCAG TCCGCTTGCT
ACTCCTTATG ATGAAACTGG TAAATTATGG GGCTTTGTGC CCGGCAGTGC AAACCAGGTT
TGGAATCCAT TAGGTGATTT TGTAGAAGGC GCTAAAATAG AAAACAGAAA ACGTTTTGGA
ACATTTACCA CCTTATACCT GGAAGCTACT TTGGCCCCTG GTTTAAAATA TCGCTTTAAC
GGAGGTGCAG AAATCAAATC AGATGTTTAC GGAAATTTTT ATGCAAGCGC AACTTCAAAT
AATCTGGGTG GTTTATCCAC TTCCAGCAAC CGTACTGGTT TCAGAACGGA TTATACACTG
GAAAATCTAC TTACCTACGA TAAGGTAATT GCTGATCATC ATAAAATAAA CTTCACAGGA
TTGTTTTCTT TACAGGAAGC TCAGAGTCAG TCAAATTCCT TTAACAACAA CAACCTGATT
GCAGATAACG TATGGTACTA CAACCCTCAG CTGGGCTCCA ATCTGGTAGG ATCAGGAGAT
TATAGTAAAT GGTCACTGAT ATCCTATATG GGCAGGCTTA ATTACGGTTT CAAGGATAAG
TATCTGTTAA CGCTTACCAT GCGTTCTGAT GGTTCTTCCC GTCTGGCACC AGGAGGTAAA
TATAAAGTAT TCCCTTCTGC GGCCGTAGCC TGGAACCTGA TCCAGGAAGA TTTTATTAAA
AGTGTAGGCT CAATTTCAAA TTTAAAACTC AGGGGCTCAT ACGGAATGGT AGGTAATACC
TCAATAGATG CCTATGCAAC ATTAGGTGCG CTTACAGGGG TCAATTATAA TTTTGGTGAC
AAAACAACTA CCGGTTTATA TCTTTCCAAT GTACCAAATC CGGCATTGAC CTGGGAAAAC
AGTACTACGG CAAACGTTGC TGTTGACTTT GGTTTTTTAA AGAACAGAAT TACCGGTTCA
ATAGAAGCTT ACCACGTTTA TACTGATAAA TTATTGCTGC CACAAAACCT TCCTTTCACC
TCCGGAATTC CAAATGCAGT TTTAACCAAT GTTGGTAAAT CTGAAAACAG GGGTCTGGAA
TTTCAGGTGA GTACCGTAAA CATTGATGGT GATGGTAAGA AGAAATTCAG CTGGAGTACC
GACATCAATG TTTCCATCAA CAGAGGTAAA ATTACACAGT TACAGGAAGG CGTAATTAAC
GACATTACGA ATAACAGGTT TGTAGGTCAG CCTATCGGAA CCATTTATGA TTATAACAGG
GTGGGCATAT GGCAAAATAC GCCTGCCGAT ACAGCCGAAG CTAAAAGATT AGGGCTTACA
GTAACTACGG GTACGGGTTC TGTAATCGGG AACATCAGGC TGGCGGATAC AAACGGGGAT
GGTAAGATCA CAGCAGATGA CCGGATATTT ATAGGATCGA GCCAGCCCAA ATGGTCTGGA
GGCATGACCC ACCGTTTTGC CTATAAAAAT CTTGATTTTA CTGTTGTAAC CTTTGGAAGG
TTTGGAAGTA CCATTATCAG TAGCGTGCAC AATAGCGGAT TTGCCAATAC CTTTCAGGGA
AACTATAACA ATCTTGATGT AAATTACTGG ACACCAACAA ACCATGAGAA CTATTGGCCT
AAGCCAAATG CCGCATCAAC CAATACTCCT AACAATTCTA CCTTGGGTTA TTTTGATGGT
ACATTTGTGA AAATCAGGAG CCTGGCATTG GGCTATAATT TACCTGCACC GCTCGCCGCT
AAGATAGGGG GAAGGTCATT GAGGGTATAC GCTTCTGTAA ACGACGCATT TATTCTTTTC
TCCAAGTACA GAAATATATA TAAGGGTATA GATCCCGAAG CGATCAGCGG AAGTAACAAC
AGAAGTTCTG TAGGTGTAGA TACACCAGCA AGCTATTCTA TGACTTTTGG TTTAAATATG
AGTTTATAA
 
Protein sequence
MINQTSTLRW CIVVFLLCCC VVDSYSRGNT ISNENYSTKK VTVSGKVSDA EGPLPSVIVK 
LKGTNTSTST NANGNYSLTI PDGTGTLVFS SIGYVTQEVA VQNRTTINVT LVAENKTLSE
VIVVGYGTQK KSDITGSIVS ISEQALKDVP VANLSQALQG RGAGIDIQKS GGNSKPGASP
VIRIRGARSL GATNDPLFVV DGIPYNGNIN DLNPDDVVSV EVLKDASSTA IYGSRGANGV
ILVTTKRGKI GEAVVTYSGY AGVTKNLGKI DVMDGKQFEM LKKWAVVNGN FVSGAPKYTG
VDDPRIMTDG IFAPQELESI KMGRSTDWQD LIYKNGITTN HQIGVSGGSE KTQYALSGGY
HNETGIYPGQ SFERFTAKIS IEQQLGKYVR VGLNSINNFS YTKGEGANPM GQVLRASPLA
TPYDETGKLW GFVPGSANQV WNPLGDFVEG AKIENRKRFG TFTTLYLEAT LAPGLKYRFN
GGAEIKSDVY GNFYASATSN NLGGLSTSSN RTGFRTDYTL ENLLTYDKVI ADHHKINFTG
LFSLQEAQSQ SNSFNNNNLI ADNVWYYNPQ LGSNLVGSGD YSKWSLISYM GRLNYGFKDK
YLLTLTMRSD GSSRLAPGGK YKVFPSAAVA WNLIQEDFIK SVGSISNLKL RGSYGMVGNT
SIDAYATLGA LTGVNYNFGD KTTTGLYLSN VPNPALTWEN STTANVAVDF GFLKNRITGS
IEAYHVYTDK LLLPQNLPFT SGIPNAVLTN VGKSENRGLE FQVSTVNIDG DGKKKFSWST
DINVSINRGK ITQLQEGVIN DITNNRFVGQ PIGTIYDYNR VGIWQNTPAD TAEAKRLGLT
VTTGTGSVIG NIRLADTNGD GKITADDRIF IGSSQPKWSG GMTHRFAYKN LDFTVVTFGR
FGSTIISSVH NSGFANTFQG NYNNLDVNYW TPTNHENYWP KPNAASTNTP NNSTLGYFDG
TFVKIRSLAL GYNLPAPLAA KIGGRSLRVY ASVNDAFILF SKYRNIYKGI DPEAISGSNN
RSSVGVDTPA SYSMTFGLNM SL