Gene Phep_3633 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3633 
Symbol 
ID8254764 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4343017 
End bp4346049 
Gene Length3033 bp 
Protein Length1010 aa 
Translation table11 
GC content44% 
IMG OID644937294 
ProductTonB-dependent receptor 
Protein accessionYP_003093886 
Protein GI255533514 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGAA TCTTTACAAA ACTTTCAGTT TTAACTTTTC TCTGTTTTCT TTTTACGAGC 
GTAACACAAG CACAGGACCT TACTGTAACT GGTGTAGTAA CAGATTTAGC AGATAAACTG
CCGCTACCTG GTGTCAGTGT ACAGGTAAAA GGAACCCAGA AAGGAACAAC CACAGATGCA
ATGGGCAAAT ATGCCATCAG CGCACCTGCA AATGCAACCC TGGTTTTCAC ATCCATTGGG
TATACCAGCC GTGAAATGCA GATCGGCAAT CAAACCACAA TTAATGTCGT GCTTTCCTCT
GCCTCCCAAG ACCTTGAAGG TGTAGTTGTA GTAGGTTATG GCACGCAAAG AAAACGTGAT
CTAACAGGAG CAATTACACA GATCAAAGGC GACGAAGTGG CAAAAATGCC AAACACCAAT
CCTCTATCTT CTTTACAGGG TAAAGTAGCC GGTTTAACCG TTGTAAACAG TGGCACACCC
GGTGCCGCAC CTACCGTAAG GATCAGGGGG GTAAACAGCA CAACATCAGG AGGCAACAAT
CCTCTTTATG TAGTTGACGG CGTGCATCAG GACAATATCG ATTACATCAA TCCCGCAGAT
ATAGAAAGCA TTGAAGTATT AAAAGACCCT TCTTCTATAG CTATTTTTGG TTTGCAGGGA
GGAAACGGTG TGATTGTAGT TACCACAAAA CGGGCAGCTA AAGGACAAAC CACCATTAAC
CTTCAAAGTT CTGCCGGTGT ACAAAAAGTA TTGAATACCA TTGATGTAAC CAATGCAGCA
CAATTCATTA AATTGTACAA CAACCTACTC GCTAATTCCG GATTGGGATC TTACGATTAT
ACCAATTATA CTGCAGATAC CGATTGGCAG AAAGAAATCC TCCAATCAGC CTTTCAAAGC
AACAATAACC TGAGCATTTC CAATAGCGGT GAAAAATCAA CCACATTGAT CAACCTGGGA
TACAATACAA TGGAAGGTGT AGTTAAATTT GGAAAGTATC AAAGATATGT AGCCCGGGTA
AACGAGGAAA TAAGAATAAA TGATAACATT AAAATTGGCG CAGATATAAC AGGAACACAC
TGGATTCTGA ACAACTCGAG CGGAGATCTG AACAATGCAT TATGGGCAGC CCCGATAGTT
GGCATCAGGG AAAGTGAAAC TGCTTATTAT GCCATGCCTG GATTTCAGCG GGGCCAGGTT
GGTAATCCGG TAGCCAGGAT TTACCAAAAC GACAGAAACA GCATTAACAA AGGTTACCGG
GTAGTCGGAA ACCTTTTCGC CGATGTTACT TTCCTGAAAA AATTTAAATG GAGGTCGCAG
TTTTATACCG ACCTTGGTTT CAACAATAAC CGGGGATATA CAGGCCTGCC CTTTACTGTT
ATCTATTTAG GAGAAGGCAA TATCCCTACC ACCAGATTCG ACAATCCAAA TGCCCGTACT
TCGGTAAGAC AGGGAGCTGA CGAATTCAGA AAGTTTCAGC AGGACCACAC AATAACTTAT
GAAAACACCT TTAATAACGA CCATAAGGTA ACAGCTGTTG CAGGTTTTAC TTCTATCTTC
AGAAGTGAGA CTAAACTTTC CGGGGACCGG ACAGACATCG GGCTAAATAT TCCAGATGAC
CCTGCTTACT GGTACATTGG CATTGCCGAC AAAAGCAATC CAAGTGGTGT AACCGGTAGT
GGTGAAGAAA GGGCCTCAAT GGGTTATTTT GCAAGGGTAA ACTATGCTTA TAAAGACAGA
TACCTGATAA ATGCCACTTA TAGAAGAGAC GGGCTGTCCA GTATTGCCCC TCAAAACCGC
TGGGGTAATT TTGGCGGTAT TGGTTTGGGA TGGGTGCTCT CCGAAGAAAG CTTCTTCAAA
AACATTAAAG GTGTTGACTT TTTGAAACTT CGTGGTTCAT GGGGAACAAC AGGTAACGGA
CAGGGCTTAC CTCCCAATAT CTTCAGACCA GGTGTTACCA CCTCTGGCTC GGGCGTGTTC
GGCGACAACA TTTATCCCGG CATTGCACCT GCCTATATTG CAGATCCAAA CCTGAAATGG
GAAGTAGTAA GAGGATTAGA CTTGGGAATG GATCTGAAAG CTTTGAACAG CCGCTTAAGC
GCAGAAATTA ATGTATATGA CCGTACAACA AAAGACATCA TTACCCAGAT CACTTTATTG
AATACTAGCG GAAGCTATCC TTACCGTACC AATCTGGGCA CTATATCCAA TAAAGGAATT
GAGGTGGCCC TGGGGTGGAA TGATAAAATC GGCAGTGATT TCACCTATAA TATCACACCT
AATTTTAGTT ACAATAAAAA CGAAGTCGTA TCCATCGGAA ATAACATTAA CTTCCTGCTA
ACCGGAAATG GCGGTGCAAA CAGGACCATT ACCGGAGAAT CGATAGGTCA CTTTTATGGA
TATAAGCAAA TAGGCATTTA TCAATCGACT GCCGATCTGG ACAAAATGGC CAGGCTATCC
AACTCACTCC CTGGTGATAT TGCCTATCAG GATACAGATG GTGATGGTAA GATCACACCT
GCCGACCGGA TTAAGCTAGG TTCTCCCTTT CCTGCCTGGA GTTATGGCCT GAACCTGAAC
TTAGGCTACA AAGGTTTCGA TGTTTTGTTA CAGGGACAAG GTGTTGCCGG CAATAAAGTA
TACACCCAGC GCCGTACCGC AACTTTTGCT GACCTGAACT TTGAAACCAA CAGATTAAAT
GCCTGGACAG GGCCAGGAAC CAGTAACGTT GAACCCATTT TACAGAAAGG CAGACTCAAC
AACTACCTGT TCAGCAGCTA TTACCTGGAG CCGGGAGATT ACTTCCGTCT GCGTACCGTA
CAATTGGGTT ATACCTTTAA ACCAGCTATG CTGGCAAAAG CAGGTGTTAA AAACCTCCGT
TTGTATGTGA GTGGACAGAA CATCCATACC TGGACAAAAA CTACCGGATA TTCGCCTGAA
GCACCGATCA GTGATGTACT TGGTGGTGGT GCTGACAATG GAGTATATCC AATTCCGGCA
GTTTACACAT TTGGTATCAA TGCAACATTT TAA
 
Protein sequence
MKRIFTKLSV LTFLCFLFTS VTQAQDLTVT GVVTDLADKL PLPGVSVQVK GTQKGTTTDA 
MGKYAISAPA NATLVFTSIG YTSREMQIGN QTTINVVLSS ASQDLEGVVV VGYGTQRKRD
LTGAITQIKG DEVAKMPNTN PLSSLQGKVA GLTVVNSGTP GAAPTVRIRG VNSTTSGGNN
PLYVVDGVHQ DNIDYINPAD IESIEVLKDP SSIAIFGLQG GNGVIVVTTK RAAKGQTTIN
LQSSAGVQKV LNTIDVTNAA QFIKLYNNLL ANSGLGSYDY TNYTADTDWQ KEILQSAFQS
NNNLSISNSG EKSTTLINLG YNTMEGVVKF GKYQRYVARV NEEIRINDNI KIGADITGTH
WILNNSSGDL NNALWAAPIV GIRESETAYY AMPGFQRGQV GNPVARIYQN DRNSINKGYR
VVGNLFADVT FLKKFKWRSQ FYTDLGFNNN RGYTGLPFTV IYLGEGNIPT TRFDNPNART
SVRQGADEFR KFQQDHTITY ENTFNNDHKV TAVAGFTSIF RSETKLSGDR TDIGLNIPDD
PAYWYIGIAD KSNPSGVTGS GEERASMGYF ARVNYAYKDR YLINATYRRD GLSSIAPQNR
WGNFGGIGLG WVLSEESFFK NIKGVDFLKL RGSWGTTGNG QGLPPNIFRP GVTTSGSGVF
GDNIYPGIAP AYIADPNLKW EVVRGLDLGM DLKALNSRLS AEINVYDRTT KDIITQITLL
NTSGSYPYRT NLGTISNKGI EVALGWNDKI GSDFTYNITP NFSYNKNEVV SIGNNINFLL
TGNGGANRTI TGESIGHFYG YKQIGIYQST ADLDKMARLS NSLPGDIAYQ DTDGDGKITP
ADRIKLGSPF PAWSYGLNLN LGYKGFDVLL QGQGVAGNKV YTQRRTATFA DLNFETNRLN
AWTGPGTSNV EPILQKGRLN NYLFSSYYLE PGDYFRLRTV QLGYTFKPAM LAKAGVKNLR
LYVSGQNIHT WTKTTGYSPE APISDVLGGG ADNGVYPIPA VYTFGINATF