Gene Phep_2244 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2244 
Symbol 
ID8253350 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2602590 
End bp2605919 
Gene Length3330 bp 
Protein Length1109 aa 
Translation table11 
GC content51% 
IMG OID644935893 
ProductTonB-dependent receptor plug 
Protein accessionYP_003092510 
Protein GI255532138 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTCG TGCTTATTTT GCTATTCCCG CTATTTGCGT TACTGAATAC GGTTGATGCG 
CAGAAAATAA CCCTGTCGGA AAAAGACGTC CCCCTTGAAA AAGTATTTAA AGAGATACGA
CGCCAAAGCG GCTATAATTT CATTTACAAT TCGCAAATGC TGAAGGAAGC CAATCCCGTA
AGTATTGTGG TAAAGAATGC GGGGTTAAAA TCAGTGCTCG ACCTGTGTTT TGCCGGACAG
CCCATTACTT ATCTGATCAA CCGCAATACA GTGGTGGTGA AATGGAGACA GCAGCCTGCT
GCTCCTGCAA ACCAGTTGCA GCTTATAAAA GGTCTGGTAA GGGACGAACA AAAAGCAGCT
TTACCTGGTG TGAGTGTTTA TTTAAAAGAT GAAAAAAAGG GGACTGTTAC AGATGCTGAT
GGCCGTTACA TGCTGGAGGT TCCGGATGAT ACAGGGATAC TGGTGTTCAG TTATATGGGT
TTTCAGTCGA AAGAAATCGC CGTAAGTTCG GGCGGCTATG CCATTGTTGA CCTTCAGGAA
GATACCAAAG GGCTGGCAGA GCTGGTAGTA GTGGGCTATG GTACCCAAAA GAAGCTGACC
GTTACGGGCT CGGTAAGCTC GGTTAAAGGG TCGGAACTCA GACAAAACCC TTCGGCCAGT
TTGCAGAACA CCCTGAGCGG CCGCCTGCCC GGTTTCTTTT CCCAGCAGCG ATCGGGCATC
CCCGGCAGTG ATGGGGCTGC CTTTTACATC AGGGGGGTAA GCACCTTTTC CAATGGGGCA
GGTGCCAACC AGCCACTGAT CATTGTGGAC GATGTGGAAT CTACCTACGA CCAGGTGGCC
AGGATAGATG CCAACGAGAT CGAAAGCATC TCTATCCTGA AGGATGCCTC TACAACCGCC
GTTTACGGCA TCAAAGGGGC TAACGGGGTA ATGGTGATCA CCACCAGAAG GGGACAGGCC
GGGCCGGCTA AGATCAGCCT GCGTACCGAG ACCGGTTTTC AGCAGCCCAC AAAAGTGCCT
GAATACCTCA ATTCCTACCG TACTGCACTG CTCAGGAACG AGGCCCTGGC CAATGATGGC
CTGGCAGCAG AGTTCTCGGC TGCCGACCTG GAGCATTTCC GCCTCGGTGA TGATCCTTAC
GGACATCCGG ACATCAACTG GTATGAGACC CTGTTTAAGG ATTTCAGCAC CCAGTGGCGC
AACAATCTCG ACATTTCGGG GGGAACCGAG AACACCAGAT ACTTTGTTTC CCTGGGCAGC
TTATGGCAGA ACGGAATGCT GCGCAACTTT GGCGAAGCCT CAGATGTGAA CAATGATTAT
TCTTATAAAC GCTATAACTT CAGGAGCAAC CTGGATGTGA ACCTCACCAA AACCTTAAGC
CTGCGCTTCG ACCTCTCGGG CAATATCGGC AGGACCAATA CCCCCAATGT ACCGGGGCCC
TTCAGCCGTA ACGATGTGTT CTTTGAGGTC AGCAATTACC AGTTCCTGCC GCCCTATGTA
TATCCCATCT ACAACCCCGA TGGCAGCTAT GGCTTCAGCA ACAGGGTGCG CGACCGCATC
AACAATGTGG TGGGCAGGCT TTCCCTGGAT GGCTACCAGC GCAACTTTGA GAACAACCTG
AACTTTGTGG CCAATGCGGT GCAAAAACTG GATGTGCTTA CCAAAGGACT GTCGGTAAAG
GCCAACCTGT CCTATGCCAG CAGCCAGAGC TCATCCCGTA ACCTGACCCG TGACCTGTTC
CCTTCCTTTA TTTACAACCC GGCCGATGAC AGTTATACCC CAAGGGACGA GAGTGTGTTC
CGCGTACAGA AGTACCGGCT GCAGTACGGG ACTGGCAATA TGCTCAGAAG GCTCAACACG
CAGCTGATGC TCAATTACGA CAGGAGCTTT GATCAGCACC ATTTATACGG ACTGGCCCTG
TTTAACCAGA TGACAGACAT TGCACCCAAT ACCGATACCC AGTACGATTA TGTGCCCTCC
AACTCCAGGG GCTTTACCGC AAGGCTGGGC TATGATTACA AAAGCAGGTA CCTTTTGGAG
TTCAACATGG CCTATAATGG TACCGACCGG TTTGTGGGCA ACAAGCGCTA TGGCCTGTTC
CCGGCCGTAT CAGCAGGATG GAACGTGGCC TCGGAACCTT TTATGAAGGG ACTGAAAGCT
GTACAGCTGC TGAAGCTGCG CGGTAGTTAT GGCATGGTGG GCAGTGATGT GGTATCGGGT
GGCAAGTACC TGTACAAACA GAGTTACGAC AGGGGCGGAA CCACCTCTTT TGGCTATTCG
CACAATGCCT ACAGTGGTAT TGTGGAAGGT ACCCTGGGCA ATGCCGATGT GAGCTGGGAA
AAGGAGCGAA AGGCCAATAT CGGTATAGAC CTGCTGATGT GGGGTGGCAA GCTGGGGGCC
ACCATCGATT ATTTTGACAA TTACCGTTAC GACATTTTAA CGCCAAGGAA CGGGGTATCC
AGCATTTTCG GGCAGGCTTT GCCGGTAATG AACGTGGGAG AGGTGAGCAA CAAGGGCTAT
GAAGTGGAAC TGACCCATAA CAACAGGATC AATGACAAGC TGAACTATAC GATAAGGGGC
AACATCTCGG TGGCCAGAAA CAAGATCCTG TACCAGGATG AGGCCGAACC GGCCTTTCCA
TGGCTGCGAC AGACCGGCCA CAGCATCGGT TCCATTGCTG TATACACCTT TAACGGTTTT
TACAGGGATG CTGCAGATGT GCAGGCCAAT CCCGCACCAA ACGGCATTGT ACCCAAACCG
GGCGACATGA AATATAAAGA CCTGAACGGG GACGGGCTGA TAGACAGCTA CGACAAAAGC
TATGTGGGCT ATCCGAACCT GCCCGATACC AATTACGGAC TGAGCCTGGG GCTGAATTAC
GGGGCTTTCA GCCTGAATGT ATTGTTCCAG GGAGCGGCAA ACTTCAACAT CCGCGGAACG
GCAGCAGCCA TTGATGCCTT CCAGTCCAAC CTGCAGCCTT TGCATGAAAA GAGGTGGACA
CCGGAAACTG CTGAAACTGC AGCTTACCCG AGGTTGTCCT CCATCATTGG CGGCCTGAAC
AGCTCTACCG ATTATCCTTC TACCTACTGG CTGGTACCGG GTGATTACCT CAGGTTAAGA
TCGGCCGAAC TCAGTTACAG TTTGCCCCAG GGCATGGTCA AAAGGCTCAG GATGCAGTCG
GCCAGGATCT ATACCAATGG CTACAATCTG ATCACCTGGT CGAAGATCGA CAAACGCTAT
CAGCTAGACC CTGAAGCCTC TTCCGGTGGA GACAAATACC CTTATCCGCC GCAAAGAATA
TACAATATTG GATTAATGCT CTCCTTTTAA
 
Protein sequence
MKFVLILLFP LFALLNTVDA QKITLSEKDV PLEKVFKEIR RQSGYNFIYN SQMLKEANPV 
SIVVKNAGLK SVLDLCFAGQ PITYLINRNT VVVKWRQQPA APANQLQLIK GLVRDEQKAA
LPGVSVYLKD EKKGTVTDAD GRYMLEVPDD TGILVFSYMG FQSKEIAVSS GGYAIVDLQE
DTKGLAELVV VGYGTQKKLT VTGSVSSVKG SELRQNPSAS LQNTLSGRLP GFFSQQRSGI
PGSDGAAFYI RGVSTFSNGA GANQPLIIVD DVESTYDQVA RIDANEIESI SILKDASTTA
VYGIKGANGV MVITTRRGQA GPAKISLRTE TGFQQPTKVP EYLNSYRTAL LRNEALANDG
LAAEFSAADL EHFRLGDDPY GHPDINWYET LFKDFSTQWR NNLDISGGTE NTRYFVSLGS
LWQNGMLRNF GEASDVNNDY SYKRYNFRSN LDVNLTKTLS LRFDLSGNIG RTNTPNVPGP
FSRNDVFFEV SNYQFLPPYV YPIYNPDGSY GFSNRVRDRI NNVVGRLSLD GYQRNFENNL
NFVANAVQKL DVLTKGLSVK ANLSYASSQS SSRNLTRDLF PSFIYNPADD SYTPRDESVF
RVQKYRLQYG TGNMLRRLNT QLMLNYDRSF DQHHLYGLAL FNQMTDIAPN TDTQYDYVPS
NSRGFTARLG YDYKSRYLLE FNMAYNGTDR FVGNKRYGLF PAVSAGWNVA SEPFMKGLKA
VQLLKLRGSY GMVGSDVVSG GKYLYKQSYD RGGTTSFGYS HNAYSGIVEG TLGNADVSWE
KERKANIGID LLMWGGKLGA TIDYFDNYRY DILTPRNGVS SIFGQALPVM NVGEVSNKGY
EVELTHNNRI NDKLNYTIRG NISVARNKIL YQDEAEPAFP WLRQTGHSIG SIAVYTFNGF
YRDAADVQAN PAPNGIVPKP GDMKYKDLNG DGLIDSYDKS YVGYPNLPDT NYGLSLGLNY
GAFSLNVLFQ GAANFNIRGT AAAIDAFQSN LQPLHEKRWT PETAETAAYP RLSSIIGGLN
SSTDYPSTYW LVPGDYLRLR SAELSYSLPQ GMVKRLRMQS ARIYTNGYNL ITWSKIDKRY
QLDPEASSGG DKYPYPPQRI YNIGLMLSF