Gene Phep_1918 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1918 
Symbol 
ID8253022 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2218415 
End bp2220478 
Gene Length2064 bp 
Protein Length687 aa 
Translation table11 
GC content43% 
IMG OID644935569 
ProductTonB-dependent receptor 
Protein accessionYP_003092188 
Protein GI255531816 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.138807 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.178557 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAAGAA AATCAAGTGC CATACGTGGC ATGAATTGTC GTTTCTCTGT TTTTTTTAAA 
GCTATACTTT GGCTGGGGAT TTTTATGCCC CTTACTTTGT TTGCCCAGGA GGATGTGGTT
AAGGCTAAAA AAGCCGATTC TATTGATAAG GTAAACCAAC TTAAAGAAGT ACAGATCAGA
ACCATAAAGA TTGGCAGGAG GCAAACTTCA TCTACACCTT TACAAATCCT TTCGGGCGAA
GAGTTGCAAC GCCTGAACAG TTTATCGGTA GCCGATGCCG TCAGGTATTT TTCGGGGGTA
CAGTTAAAAG ATTATGGTGG CATTGGTGGC TTAAAAACCA TCAATGTGCG CAGCATGGGT
ACCAACCATA CTGCTGTTTT TTACGATGGG GTACAATTGG GCAATGCACA GAACGGGCAG
GTCGACCTGG GAAAATTTTC ATTAGACAAC ATAGCCGAAA TTGAGCTGTA TAACGGGCAA
AAGAGTACCA TATTTCAATC GGCAAAAGGC TTTGCTGCCG GAAGTTCATT GTATATAAAC
TCAACGCAGC CTGATTTTGA AGATGGGCGC AGTGATAAAT GGAAAGCCAC ATTAAAAGGT
GGTTCATTTG GCCTGATAGA TCCTTCTGTG CTTTGGCAGC ATAAAATCAG CGAAAGCATA
TACAGTTCCT TAAGTGCCGA ATGGAAAAAA GCAAGTGGCC GTTACAAGTT CAGGGTAAGG
AATTACGGTT ATGATACCAC AGCAGTAAGG GAAGATGGTG GGATTGAGAC CATGCGGCTT
GAGCTTGGAC TGAACGGGGT ATTGCCGGAT AGCAGTCTGT GGACGGTTAA GTTGTATGGA
TACAATGACA AACAGGGCTT GCCAGGGGCC ATAGTAAATA ATGTATGGCG TTTTCCGCAG
AGAATGTGGA ACAGAAACTT TTTTGCGCAG TCGACCTTCA AAAAAGACCT GCACAAATAT
CATCTTTTGG CTGCTGTTAA ATATGCTAAC GATTATACCA AATATCTTGA TCCGAACTAT
GTAAAAGATA CTGGCTTTCT GACAAATATC TATAAACAGC AGGAGCTGTA TTTTTCGCTG
GCCAACCGTT ACCAGCTCAG TTCATTCTGG GATATCGTTT TGTCGGGCGA TTACCAATGG
AACACTTTGG ACGCCAATAT GGATCGTTTT CCATATCCTA CCAGGTATAC GGGCCTGCTG
GCCCTGGCTA CCGAAATTCA TTTGGACAGG TTAAATATTC AGGCCAATTT GCTGGGCACA
CTGGTTAATG ATGAGGTAGA ACGCTATTTT TCTGCGGGTA ATAAAAGGGA ACTTAAGCCC
TCTGTAATGG TTTCCTGGCA GCCTTTCAGC TTAAAGGAAT TTCGTTTACG CGGATTTTAT
AAGGATATAT TCAGGATGCC TACTTTTAAT GACCTGTATT ATACCCTATT GGGGAACACT
TTCCTGAAAC CGGAATATGC AAAACAATAC GATCTTGGAT TTACCTATAT CAGGTTGATT
GACAATCATT TGCTGAACCA GATCAGTATA CAGTCGGATG TATATTACAA CAAGATCAGA
AATAAGATTG TTGCAGTTCC AGGGGCCAAT TTGTTTACCT GGAGCATGCA GAACCTGGGG
TTGGTAGAGA TCAGGGGCCT GGATGTGAAT ATACAGACCG GCTGGCGTAT TGCCGATCAG
CTGATGGTCA ACACCGGTAT TACCTATACC TATCAGAAGG CTTTGAACAT GACAGATGTG
AACCTGAACT ATAAGAACCA GATCCCATAT ATTCCTGTAC ATAGTGGTTC CTTTACTGCT
GGTGCCGACT GGAGAAACCT GGGTCTGAAC TACAGTTATA TCTATACGGG CGAGCGTTAT
GATCAAAGTG CCAATATTAT TGAAAATTAT GTTCAGCCCT GGTACACCCA TGATATTGCC
TTTCATTACC ATAAAGATCT AAAACATGCC CGTGTTAAAG TATCGGTCGA GGTAAACAAC
CTTTTGAATC AAGATTATGA AGTGGTTACC AATTTTCCAA TGCCGGGCCG GTCGTATCGT
TTCACCTTAT CTTATGCTTA CTGA
 
Protein sequence
MIRKSSAIRG MNCRFSVFFK AILWLGIFMP LTLFAQEDVV KAKKADSIDK VNQLKEVQIR 
TIKIGRRQTS STPLQILSGE ELQRLNSLSV ADAVRYFSGV QLKDYGGIGG LKTINVRSMG
TNHTAVFYDG VQLGNAQNGQ VDLGKFSLDN IAEIELYNGQ KSTIFQSAKG FAAGSSLYIN
STQPDFEDGR SDKWKATLKG GSFGLIDPSV LWQHKISESI YSSLSAEWKK ASGRYKFRVR
NYGYDTTAVR EDGGIETMRL ELGLNGVLPD SSLWTVKLYG YNDKQGLPGA IVNNVWRFPQ
RMWNRNFFAQ STFKKDLHKY HLLAAVKYAN DYTKYLDPNY VKDTGFLTNI YKQQELYFSL
ANRYQLSSFW DIVLSGDYQW NTLDANMDRF PYPTRYTGLL ALATEIHLDR LNIQANLLGT
LVNDEVERYF SAGNKRELKP SVMVSWQPFS LKEFRLRGFY KDIFRMPTFN DLYYTLLGNT
FLKPEYAKQY DLGFTYIRLI DNHLLNQISI QSDVYYNKIR NKIVAVPGAN LFTWSMQNLG
LVEIRGLDVN IQTGWRIADQ LMVNTGITYT YQKALNMTDV NLNYKNQIPY IPVHSGSFTA
GADWRNLGLN YSYIYTGERY DQSANIIENY VQPWYTHDIA FHYHKDLKHA RVKVSVEVNN
LLNQDYEVVT NFPMPGRSYR FTLSYAY