Gene Phep_0754 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_0754 
Symbol 
ID8251843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp874966 
End bp878082 
Gene Length3117 bp 
Protein Length1038 aa 
Translation table11 
GC content46% 
IMG OID644934404 
ProductTonB-dependent receptor plug 
Protein accessionYP_003091038 
Protein GI255530666 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGTGTA TTAGATTTAT AAACGAGCAT GAACGGAAAA TCCTGTTTTC TTTTTTTCTG 
CTCCTTAATT ATGCCTTTCC GGCCTTCTCC CAGTCTGCCA TTACCGTGCA GGGTAAAATT
ACAGGTGCCA AAGGGGAGGC CCTTTCTCAG GTCACTGTTG TTGTAAAGGG AACAAAGGCC
AGTACCAGTA CAAACCCTGA TGGCAATTAT GCCATCCAGG TGCCTGGCAA CAATGCAGTT
CTCGTATTCA GTTCAATGGG CTTTTTGAAA CAGGAGATTC CGGTAAACGG CCGGAAACTG
ATCAATATTG AACTGATGCA GGATGTTAAA GCGCTGGAAG ATGTGGTGGT GATCGGTTAT
GGTATAGCCC GGAAAAGCGA TCTTACCGGT TCCGTATCTT CTATAAAAGC TGAAGACCTT
AAAAAAACAC AGGTAACTTC CTTTGATCAG GCCCTGCAGG GGCGTGCTGC AGGGGTGCAG
GTAACCCAGA TTTCGGGTAA GCCAGGTGCT GAAACTTCCA TCAGGATACG GGGCACCAGT
TCTATCAATG CCGGAAATGA ACCTTTATAT GTAATTGACG GGATGCTGGT GAGCAGCGAT
GGTGCAGATA TGAGTACCGG TGCAACCCGG GGCCCCAGGA TCAGTCCGCT GTCCTCCATC
AATCCCAGTG ATATCGAATC CATAGAAATA TTGAAAGATG CTTCTGCAAC GGCGATATAT
GGTTCAAGAG GTGCGAACGG TGTAGTGCTG ATCACGACTA AAAGGGGCCG TACTGGTGCC
GGCATTGTCA CTTTTGAAAG CTATTATGGC ATTCAGGAAA TCTCTCACAA AGTAGAGGTG
CTGGATGCCG AACAATTTGC CAACCTGGTG AACGAAGCAA AGCTGAATTC CAACGCAACG
CCCATATATG TAAATCCTAA AAACCTGGGC AAGGGAACCG ACTGGCAGGA TGAGCTGCTA
CGTACAGCAC CTATGGCCAG CTATCAGCTG TCGTTTTCAG GGGGCGATGA AAAAACAAAA
TATGCGGTAT CTGGTGGTTA CTTTACACAG AAGGGTATCA TTTTAAATTC TGATTTTAAA
AGATATGCAT TCCGGGCCAA TATGGACCGT GAGGTGAGTA GCAGACTGAC CATTGGCAAC
AGTTTAAGTT TCTCAAGAAT AGCTTCAACA GGGGTGCTGA CCAATTCAGG AACCATTGTT
CCGGGGGTAA CCAGCGCTGC GCTTTTGTTC AATCCGGTAT TGCCTGTTTA TGATGCCTCG
GTGGCAGGTG GTTATACCTT CGAGAACGAC AGGGGTAAAA CACTTGGCAA TCCTATTGCA
GAAGCTAAAG AATACAATTC TTACAGTACC ATAGCGCGGT TGCTGGGTAA TGTATACGCC
AAATACAAGA TTGTTGAGGG ACTCGATTTT AAGACCAGCT TTGGTATAGA TCAGTTTACT
TCCAAGGAAA ATGCTTTCGG ACCAAACTTC CTGAAAAGAA CACAGGCCAG TAAAGGTGAA
GCTTCTGTTG GTGATATCTC TGGGTTAACC TGGCTAAATG AAAATACACT TACCTATCAC
AAGACGATTA AGGAAGATCA TGTATTTGAT ATTCTGGGCG GTTTTACCAT GCAGCGTTTC
AATAACGAAA GCCTGTTTGC CTATGCGTTT GATTTCCCTG ATAACCTGAC GGGCTATCAT
AATCTGGGTA CTGCCCTGAA CCCGCAGAAG ACAACCAATA ATGAATCTCA ATGGAGTTTG
ATCTCTTACC TGGGCAGGAT CAATTACAGC CTGAAAAACA AATACCTGTT TACAGCAACA
GGTCGGATAG ATGGTTCTTC CAAATTTGCG GAAGGAAAAA AATATGGTTT CTTCCCATCG
GGAGCGTTTG CCTGGAAGGT GGTGGAAGAG GACTTTATGA AAACAGTAAA ACAGCTCTCT
GACCTGAAAC TAAGAGTAAG TTACGGACTG ATCGGCAACC AGAATATTGC GCCCTATCAG
TCGCTCGCCC TGGTGGGCCC TTATGGAGAG GGTGTTTTTA ACGGTTCGGA GATCTATACC
GGACGTGAGC CGCTTACCTA CGTCAATAAA AACCTGAAAT GGGAGAGTAC ACGTCAGTTT
GATATAGGGA TGGATGTGGC CTTTTTTGAC AACCGCATTG CGCTTACTGC CGATTATTAC
CATAAAAAAA CGAATGACCT GTTGCTGTCG TCGCCCATTC CGCTTACTTC CGGCTTTACT
TCAACCCTGT TAAATATTGG AAACATCGTA AACAGAGGCT TTGATTTTGA CCTGCGTACG
GTAAATACTA CCGGGGCATT AAAATGGAAC AGCTCTGTCA ATTTTTCTAT CAACAGAAAT
GAGATAACCA GTCTGGCCAA TAAAAATGCT GATATCCTTT CTTCCGGTAG TCTGCTACGT
GTAGGGCAGC CCGTAGGTAC ATTTTACGGA TATATCTTTG AAGGCATCTT CCAGTCTGAC
GCAGAAGCTG CAGGCAGCCC TGTGTTAAAA GGACAGGAAG CCAATTCGGC CAATGTGGCC
TCGAGGGCTA AGGCTGGCGA CAGAAAATAC CGTGACATCA ACAAAGACGG GGTGATCGAT
GAGGGTGACC GGACGATCAT CGGCAGTGCG CAGCCTGATT TTACCTGGGG ATTTAACAAC
ACGCTTTCCT TTAAAAATAT AGATCTGAGT TTCTTTTTTC AAGGTTCTCA GGGCAATAAA
ATGGCCAATC TCAACTCCTA CGACCTGCTG AACTTTAACG GGCAGACCAA TGTGCTGAAA
GAAGGGGGGC TGAACAGATG GACACCTGAA AACCACAGCA ACAAATACCC GAGGGCGGTA
TCCGAAGGCA GTCTGGACCA GGGCGTATTT TCTACTGCCA TTGTGGAAGA TGCTTCTTAT
ATCAGGTTAA GAAATGTGAC GCTGGCCTAT AACCTGCCTA AAAGCTGGGT ACAGAAAATA
AAACTGAACA ACGTAAGGGT GTATGCCAGT GCAACAAATC TCTGGACACA TACCAAATAC
AGTGGCTACG ACCCTGAGGC CAATACTTTT GGACAGAACA GCTTTGTAAT TGGCTACGAC
CAGGGCGGTT ACCCCATTGC AAAAACATAC AGTCTTGGAA TTAACGTTGG TTTTTAA
 
Protein sequence
MMCIRFINEH ERKILFSFFL LLNYAFPAFS QSAITVQGKI TGAKGEALSQ VTVVVKGTKA 
STSTNPDGNY AIQVPGNNAV LVFSSMGFLK QEIPVNGRKL INIELMQDVK ALEDVVVIGY
GIARKSDLTG SVSSIKAEDL KKTQVTSFDQ ALQGRAAGVQ VTQISGKPGA ETSIRIRGTS
SINAGNEPLY VIDGMLVSSD GADMSTGATR GPRISPLSSI NPSDIESIEI LKDASATAIY
GSRGANGVVL ITTKRGRTGA GIVTFESYYG IQEISHKVEV LDAEQFANLV NEAKLNSNAT
PIYVNPKNLG KGTDWQDELL RTAPMASYQL SFSGGDEKTK YAVSGGYFTQ KGIILNSDFK
RYAFRANMDR EVSSRLTIGN SLSFSRIAST GVLTNSGTIV PGVTSAALLF NPVLPVYDAS
VAGGYTFEND RGKTLGNPIA EAKEYNSYST IARLLGNVYA KYKIVEGLDF KTSFGIDQFT
SKENAFGPNF LKRTQASKGE ASVGDISGLT WLNENTLTYH KTIKEDHVFD ILGGFTMQRF
NNESLFAYAF DFPDNLTGYH NLGTALNPQK TTNNESQWSL ISYLGRINYS LKNKYLFTAT
GRIDGSSKFA EGKKYGFFPS GAFAWKVVEE DFMKTVKQLS DLKLRVSYGL IGNQNIAPYQ
SLALVGPYGE GVFNGSEIYT GREPLTYVNK NLKWESTRQF DIGMDVAFFD NRIALTADYY
HKKTNDLLLS SPIPLTSGFT STLLNIGNIV NRGFDFDLRT VNTTGALKWN SSVNFSINRN
EITSLANKNA DILSSGSLLR VGQPVGTFYG YIFEGIFQSD AEAAGSPVLK GQEANSANVA
SRAKAGDRKY RDINKDGVID EGDRTIIGSA QPDFTWGFNN TLSFKNIDLS FFFQGSQGNK
MANLNSYDLL NFNGQTNVLK EGGLNRWTPE NHSNKYPRAV SEGSLDQGVF STAIVEDASY
IRLRNVTLAY NLPKSWVQKI KLNNVRVYAS ATNLWTHTKY SGYDPEANTF GQNSFVIGYD
QGGYPIAKTY SLGINVGF