Gene Phep_4106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_4106 
Symbol 
ID8255240 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4954698 
End bp4957844 
Gene Length3147 bp 
Protein Length1048 aa 
Translation table11 
GC content41% 
IMG OID644937770 
ProductTonB-dependent receptor plug 
Protein accessionYP_003094359 
Protein GI255533987 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAAAC TCAAACTACT AAATTGTAAC TTTTTAAGTC TGAGGGTTAC AAAATTGTTG 
CTTGCTTCTG TTTTTATAGT CTTACTGGCC GCCTCCGGTA AAGCTGTAGC AGCAAAGAAA
ATGTTGGCCC CCATAACCAT CACAGGTAAG GTGATGGATG AAAAAGGCGA GACCATTATT
GGGGCAACCA TCAAAAGTTC GGCAGGTGGT GGTGCCGTTA CCGATGTAAA CGGAGGTTAT
TCTTTAACTA CAGAGAGTAA CGCGACGCTT ACTGTTAGTT ACCTGGGATA TATTAGCCAG
GAAGTAAAGG TAAATAACCG CACAAAAATT AACATTACCC TTGTCCCTGC GGCCAATCAG
TTAAATGATG TAGTTGTAAT CGGCTACGGT ACGCAAAAGC GTAAAGATGT CACCGGCTCA
ATTACTACCG TCAGATTAGA AGATGGTCCT AAAGCCAGTG TCCCGTTTGT AAATGCGTTA
GAAGCATTAC AAGGTACATC TGGTATCAAT GTTGGCCCTT CAAATGCTGC AGGTGCTACC
CCTAACATTG CAATCAGGGG GCAGAATTCT ATCAGTGGGG GAGGAAATCC ATTAATCATT
TTAGACGGGG TGATCTTTGA TGGAAACCTT AATGAGATCA ATATGAACGA TATTGCTTCT
TATGATATCC TGAAAGATGG TAGTGCTGCT TCCATTTATG GTTCACGTTC TGCAAATGGG
GTACTTGTAA TTACTACCAA GCGTGGCAGA ACAGAAAAAC CTCAGATCAA CTTCAGCACC
TATTATGGCG TACAAAACTG GACAAGGGTT CCTAAAATGA AAGCCGGTGA TGAGTTTATT
CAGTGGAGAA AAGACAATAT GTCTATCAGG GGTCAGGACA TTACAGATAT GACGAAAGTG
TTGTCGAATC TAGAGTATAA GGCGTATAAT GAGGGGCATA CCTTAAATTG GCTGGATGAA
GTTACACAAT TTGCGCCTAT CCAGAATTAC CAATTAAGTG TCTCCGGAAA AACAGATAAC
ACCAATTATT ATTTCTCAGC GGGTTACCTG AACCAAAAAG GGGTGTTGTA CAACGATAAG
TTTAAGAAGC CAAACCTCAC CCTAAAAGTA GAGAATAATA TTACAGACTG GTTATCGTTT
GGTGCAAACG GATATTATTC GTCACGTGAT TTTACAGGCA ATTCGCCAAG CTTATACATG
GCTACTTATA TATCCCCATA CAGTTACAGG TATGTAGATG GCAGCGATAT CTACCAGCGT
TACCCCACAG GAAATACGTC AATGTACAGT CCTTTCTGGG GTAGCCCTAC TAATGTTACT
CAACCTGGAG TCTATGATGA CGATTTGAAC AAGCAAAATA CGATAAGAGG TACCGGGTTT
ATCAATGCGA AAATACCTTT TATCAAAGGT TTGAACTATA GATTTGAAGC TACAGGGACA
AAATCGGTTT CGAATACAGG CTTTTTCCAC CACGAATTTG GTGAAGTGAA CACTTTGCTT
CCTGGAGACA TTGCGAATCC TGCACAGTTT TTGTCCAGAG CAAATGGGTA CAGGATAAAT
AATCAGGGAA ACAGCTGGGT AATCAATAGC CTGATTTCCT ATAACCGTTC ATTTGGGGAC
CATAATATTG ACGCTTTGTT TGGTTATACC CGCGATTTCA GTACAACTGA ACAGCTGAGG
GTAAATTCAT CCGATTTCTC TGCAGCCGGA ACCACCTTGT TAGGGATGAA CGGTTTAAAT
ACGGGTAAAG TCATTACAAC AAATACTGAA TTTGCTAAAA CAACCAATGT TGGTTATTTC
GGCAGGTTAA ATTACAACTA TAAATCAAAG TACTACGGTA CATTTACCTT GCGGCGTGAT
GGTTATTCTG CATTCGCCGA AGGCTTTAAG TACGGTTATT TTCCAGGGGG ATCTGTTGCA
TGGGCATTGA GTGAAGAAAA TTTCATGAAA GATGTGAAAT TTGTAAACTA CCTGAAAATA
AGGGCTTCTT ATGGAAAAAC AGGTAGCCAA TCGGTAGGGT CATACTCTTC ACTTGCTTTT
ACTGACAACA CCTTGTTTAC CGTTTTTGGC AGCAATTCTT TTTTAATCAG TACTCCTTCC
ACACTGGCTA ATAAAACATT TACCTGGGAA ACTACAAATA CCTTAAACCT TGGGGTAGAT
TTTCAGTTGT TAGACCAGCG CTTAACCGGT AATGTTGATG TTTACTCCAG CAAAACCGAT
AATCAACTTC TAACACGATT GCTGCCAATT TTCACTGGTT TTAGTTCCGT AAAATCGAAT
CTCGGCGAAG TTCAGAACAG GGGAATTGAA ATAACTTTAA ATTCAACCAA CATAAAATCT
GATGATGGAT TTAGCTGGAG CTCTGGAATC AGTTTCTGGC TGAACAGAAA TAAAGTGACG
CATTTGCCTG ATGATAAAGA CCAGCCGGAA AACTCATTGT TCATTGGTAA GTCTTTATTA
GGGTTTTATG ATTATACCGT TGAGGGAATC GTGCAAAGTT CTGATACGGA ATATATGAAC
AAATACAAAA CTGCAGGTGG AGCCCAGATA TTCTTTCCTG GAGACCTTAA AATCAAGGAC
ATCAATGGTG ATGGGATGAT AGACGTTAAC GACAGGTCCG TAATTGGATA CAGCAAGGAA
AACTTCAACT TCAATGTTTC CAATACCTTT AATTACAAAA ATTTCCAATT GTTTTTTACT
GTTAACGCTA TCATTGGTGG TGGAAAGAAC AATTTCTTTA TGTCGGGCAA TCCACGCGGG
TTAAATCCAG CGTCGCTTTT GCCAACTTCA GGAAATTGGA TTGCCGGCCA AAACCCATGG
ATGCCGGATC GTGAAAGCAA TGAATTTGTA AGACCTAACT ATGGAAATCC GTTTTCATAT
GGTTTTTACC AGTCGCGTAC ATTCGTGCGT TTGCAAACGG CCTCATTAAG CTACAGTCTT
CCGAAAGAAC TGCTCAACAA GTTAAAAGTA GATAATTTGA AGTTGTTCGT AAGTGGTACC
AACTTACTGA CTTTTACAGG ATGGACAGGT CTGGATCCAG CCAATGGGGC ACAGATTGGT
GGTAACGGCG GGTCTTCTCA GACATCGGTA AACGCCAATA CGCCTATTAT GAGAACTGTT
TCCTTTGGCT TAAACTTAGG GTTTTAA
 
Protein sequence
MTKLKLLNCN FLSLRVTKLL LASVFIVLLA ASGKAVAAKK MLAPITITGK VMDEKGETII 
GATIKSSAGG GAVTDVNGGY SLTTESNATL TVSYLGYISQ EVKVNNRTKI NITLVPAANQ
LNDVVVIGYG TQKRKDVTGS ITTVRLEDGP KASVPFVNAL EALQGTSGIN VGPSNAAGAT
PNIAIRGQNS ISGGGNPLII LDGVIFDGNL NEINMNDIAS YDILKDGSAA SIYGSRSANG
VLVITTKRGR TEKPQINFST YYGVQNWTRV PKMKAGDEFI QWRKDNMSIR GQDITDMTKV
LSNLEYKAYN EGHTLNWLDE VTQFAPIQNY QLSVSGKTDN TNYYFSAGYL NQKGVLYNDK
FKKPNLTLKV ENNITDWLSF GANGYYSSRD FTGNSPSLYM ATYISPYSYR YVDGSDIYQR
YPTGNTSMYS PFWGSPTNVT QPGVYDDDLN KQNTIRGTGF INAKIPFIKG LNYRFEATGT
KSVSNTGFFH HEFGEVNTLL PGDIANPAQF LSRANGYRIN NQGNSWVINS LISYNRSFGD
HNIDALFGYT RDFSTTEQLR VNSSDFSAAG TTLLGMNGLN TGKVITTNTE FAKTTNVGYF
GRLNYNYKSK YYGTFTLRRD GYSAFAEGFK YGYFPGGSVA WALSEENFMK DVKFVNYLKI
RASYGKTGSQ SVGSYSSLAF TDNTLFTVFG SNSFLISTPS TLANKTFTWE TTNTLNLGVD
FQLLDQRLTG NVDVYSSKTD NQLLTRLLPI FTGFSSVKSN LGEVQNRGIE ITLNSTNIKS
DDGFSWSSGI SFWLNRNKVT HLPDDKDQPE NSLFIGKSLL GFYDYTVEGI VQSSDTEYMN
KYKTAGGAQI FFPGDLKIKD INGDGMIDVN DRSVIGYSKE NFNFNVSNTF NYKNFQLFFT
VNAIIGGGKN NFFMSGNPRG LNPASLLPTS GNWIAGQNPW MPDRESNEFV RPNYGNPFSY
GFYQSRTFVR LQTASLSYSL PKELLNKLKV DNLKLFVSGT NLLTFTGWTG LDPANGAQIG
GNGGSSQTSV NANTPIMRTV SFGLNLGF