Gene Phep_2243 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2243 
Symbol 
ID8253349 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2599729 
End bp2602470 
Gene Length2742 bp 
Protein Length913 aa 
Translation table11 
GC content48% 
IMG OID644935892 
ProductTonB-dependent receptor 
Protein accessionYP_003092509 
Protein GI255532137 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTCAG CGCCAGCGGC CGGTCCGGTT ATCAGGAATG AGTTCCGGAT TTCAGCGACA 
CAGCTCAAAA CAGGATCAGC CAGCTGGCAG GACAGCCTGA AGCTCAAGCG GAAAAAGCCT
GCCCAGCTTT TATCAGGCAG TGTTTCCGAT ACCCTTAAAA TTGCGTCCAG TGATGCAATT
TACACAGCAG ACCTGCTGAA ATCCCCGACA GGGAGTTTAT ATAACATATT AACCGGCCGT
TTATCGGGCC TGTACACCAA CCAAAGCTCG GGCGAACCGG GATTTGATGG TGCAGCTTTG
CAGTTAAGGG GACAATCGCC TTTAATGATC ATTGATGGGA TACCACGCTC GGTAACCTTG
CTGGACCCGG AGGAAATTGA ATCGGTTACC GTACTTAAAG ATGCCCTGGC CACGGCCATG
CTGGGTGTAC GCGGATCGAA TGGGGCTTTG CTGATCACTA CCCGGAAGGG GTTTAACGGA
AAACGTAAAA TTGCCTTTAC GGCACAGACT GGTGTACAGC AAGCCTTGAA AATGCCCGAT
TTTTTAAATG CTTATGACTA TGCAAACCTG TACAATGAGG CGCTGAGGAA TGATGGGCTG
GCCGCAAAAT ACAGCAATGC CGATCTGGAA GCTTACCGGA ACGGGACAGA CCCGACGGGG
CATCCGGATG TAAACTGGAA AGATCAGGTA TTGCGCCCGG CAGCCCCTAT AAGCAGGTAC
AACCTGAATG TAAGTGGTGG CGGAGATGCC ACACGTTATT TCATCTCGCT CGAGAATTTT
AACCAGGGCG GTCTGATCAA AGAATCCGAT GCCAATAAAT ACAGTACCAA TTCGAGCATC
ACCAGATATA TTGTCCGCTC CAATATAGAG GTAGATCTGG ATCGGCAGAC CACACTGGGT
TTAAGGTTAT TCGGGCGCAT CATGAACGGA ACGGAGCCGG GTGGCACAGT AACCAATATT
TTCAGTTCGC TGATCAATAC ACCCAATAAT GCTTATCCGG TATTTAATCC GGACGGTTCT
TTGGGTGGGG TACAGCAGTT TACCAATAAT ATATACGGAC AGGCTACGGC TGCAGGTTAT
CAGGCCACTT ATAACCGCGA CATGATTGCC GATGTTTCGC TGACCCGTAA ACTGGATTCC
TGGGCCAAAG GGCTATGGGT ACGCGGTATG GCTTCTTATT TTGCCTCTGC CAGTCAGCGT
ACCATGCGCA ATAAAACCTT TGCCACCTAC CAGTATACCG GTACCGGCTA CAGGGTTTTT
GGGGTGAACG GCGACCAGTC GAATGCCAGT ACCGTTTCGC AGAACAACCG TCAGGTATAT
GCAGAATTTT CTGCCGGGTA CAGCAACAGG TTTGGTGCAA ATGGCATTGA TGTTTATCTG
GCTGCCAATA GTGATACCCG TTCGATAGAT GGCGACCTGG ACCTGAACTA TTCAGGGATT
TCCGGTAAGC TGACCTACGA TTTCAATAAA AAATACATTG CAGAACTGGC ATTCGGAATG
AACCGTTCTA ACCGTTATCC TAAAGGCACA CCACTTGGCC TGTTCCCGGC CCTTGGTTTG
GCCTGGAACA TTTACCAGGA AGATTTTATG AAAGATTCCT GGCTGAAAGA CCTGAAACTT
AGCGCTTCTT ACGGCAAAAC AGGCTGGGAC AAGGCCGGTT ATTACGTATT TAACCAATAT
TATACGGACG CATCAGGATT GGGTTACGTA TTTGGTGCTA CACCTGCAAC GGTAAATGGG
GTAACAGAAT CTACACTGGC CAATCCCGAC ATCCGATGGG AAAAATCAGA TAAGCTGAAC
ATAGGCTTGC AGGGTAGTGT GCTAAACCAG CGGTTAGCTT TTGATGTGCA GTATTTCAAC
AACAAATATT ATGACCTGCT GATGCAGCGG GGCAGGAACA GTGCGGTACT GGGCAACGAT
TATCCGGACG AAAACATCGG CATCAACCGT TATACAGGAG CAGATTTTCA GTTGAAATGG
CAGCAACAGA GCGGTAATTT CAGCTATTTT ATCGGAGGAA ATGCGAGTCT GGTCAAGTCC
AGGGTGATAT ACAGCGACGA GGTATTTCAG CAGTACAGCT GGATGCAGCG TACCGGGCAG
GCAGTGGGAC AGCCTTTTGG ATATATTGCC GAGGGCCTGT ACCAGACTGT TGATGCCGGC
AGTGTAGCGG TAACCGGCTA TACCCCACAG CCGGGTGACA TTAAATACAA GGACCTAAAC
AATGACGGTA TCATTGACCA ATACGATGAG GCTCCTATAG GTTCTACCAA ACCACTGTTT
TTTTATGGTG TTACATTGGG TTTCAACTGG AAGGGCCTTG ATTTTAGTGC ATTGCTGCAA
GGTGTAGAAA ACAGGAATAT GCTGCTTACC GGCAATAGTG AATGGGAATT TCAGAGCAAT
GGTTTCGGAC AGGCTTATCC TTTCCAGCTG GACCGCTGGA CACCACAAAC TGCAGGCAGC
GCCAGCTATC CGCGTTTAAC GGCAGGTAGC AATGTAAACA ACCATAAGAC TTCCTCTTAC
TGGATGCATT CAGGCGATTA CCTGCGCCTA AAAACGGTAG AACTGGGCTA CACGCTTCCA
ATGAGAATTT CCAGAAAGAT TAAACTGGAC AACATCAGGG TATTTGCCAA TGCACTTAAC
CTGTTTACCA TCGCTGATTT TGACCGGGTA GATCCGGAAG TGAAACCAGG TTCCTATCCG
ATACAGCGTG TAATTAATGG CGGTATTTCA ATTAAACTAT AA
 
Protein sequence
MASAPAAGPV IRNEFRISAT QLKTGSASWQ DSLKLKRKKP AQLLSGSVSD TLKIASSDAI 
YTADLLKSPT GSLYNILTGR LSGLYTNQSS GEPGFDGAAL QLRGQSPLMI IDGIPRSVTL
LDPEEIESVT VLKDALATAM LGVRGSNGAL LITTRKGFNG KRKIAFTAQT GVQQALKMPD
FLNAYDYANL YNEALRNDGL AAKYSNADLE AYRNGTDPTG HPDVNWKDQV LRPAAPISRY
NLNVSGGGDA TRYFISLENF NQGGLIKESD ANKYSTNSSI TRYIVRSNIE VDLDRQTTLG
LRLFGRIMNG TEPGGTVTNI FSSLINTPNN AYPVFNPDGS LGGVQQFTNN IYGQATAAGY
QATYNRDMIA DVSLTRKLDS WAKGLWVRGM ASYFASASQR TMRNKTFATY QYTGTGYRVF
GVNGDQSNAS TVSQNNRQVY AEFSAGYSNR FGANGIDVYL AANSDTRSID GDLDLNYSGI
SGKLTYDFNK KYIAELAFGM NRSNRYPKGT PLGLFPALGL AWNIYQEDFM KDSWLKDLKL
SASYGKTGWD KAGYYVFNQY YTDASGLGYV FGATPATVNG VTESTLANPD IRWEKSDKLN
IGLQGSVLNQ RLAFDVQYFN NKYYDLLMQR GRNSAVLGND YPDENIGINR YTGADFQLKW
QQQSGNFSYF IGGNASLVKS RVIYSDEVFQ QYSWMQRTGQ AVGQPFGYIA EGLYQTVDAG
SVAVTGYTPQ PGDIKYKDLN NDGIIDQYDE APIGSTKPLF FYGVTLGFNW KGLDFSALLQ
GVENRNMLLT GNSEWEFQSN GFGQAYPFQL DRWTPQTAGS ASYPRLTAGS NVNNHKTSSY
WMHSGDYLRL KTVELGYTLP MRISRKIKLD NIRVFANALN LFTIADFDRV DPEVKPGSYP
IQRVINGGIS IKL