Gene Phep_2026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2026 
Symbol 
ID8253130 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2335069 
End bp2336331 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content41% 
IMG OID644935674 
ProductABC transporter related 
Protein accessionYP_003092293 
Protein GI255531921 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1134] ABC-type polysaccharide/polyol phosphate transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.476742 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0046378 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTAACC TGATGCTAAA AGCAGAAAAC ATATCAAAAT ATTACCGACT GGGCGTAATA 
GGATCCGGAT CATTTAAAGA AGATCTCCAA AACCTCTGGA AGAAAACATT TTCAATTGGC
AATAATACAG CAACGAACGA CCTCCATATC GACAAAGGAA AAGAACTATG GGCGCTTAAA
GACATTAATT TTGAGATAAT GAAGGGCGAC GTGGTCGGCT TTGTTGGCAA AAACGGAGCT
GGAAAATCAA CCCTGTTGAA AGTACTGTCG CGCATTACGC TACCAACAAC CGGCACCATT
AAAGGCAAGG GCCGCATTGC AAGTTTATTG GAAGTTGGCA CAGGCTTCCA TTTAGAGTTG
ACAGGCCGCG AGAACATCTT TTTAAATGGG CAAATTTTGG GTATGCACAA AAAAGAGATC
ATAGCTAAAT ATGACGAAAT TGTAGCTTTT TCCGGTATTG AGCGATTTCT GGATACTCCT
GTAAAACGCT ACTCCAGCGG GATGTACGTT CGGCTTGCAT TTGCCATAGC TGCGCATTTG
GATCCCGAAA TCCTCATCGT TGATGAGGTA TTAGCTGTGG GTGATGCTGA ATTTCAGAAA
AAATGCCTGG GTAAAATGAA ACAGGTGTCG ACAGAAGAAG GCAAAACTGT ACTTTTTGTA
AGCCATAATT CGCAAGCGCT AAAAAGCTTA TGTACCAAAG CAATCTACCT TGAAAAAGGG
CGGCTAATTG ATATGGGAAA TATGCAAGAT GTAATAGGCA ATTATTTAAA GCGTGAGCAA
ACATTATATC TAAGCAGGAT ATACGATGAC CCTGACACTG CGCCAGGAAA TGAAAGTGTC
CGTATCAAAC GTGTCGAAAT GTTGCCACAA TATCCCGATT CCAGCAATAT CATAGACATC
AGAACACCTC TGCTCATCGA ATTCGAATTT TGGTATTTAC CAGCGGAAGA AATGGATCTG
GGTGTAAACA TTATATTAAA CACCGTAATG GGAGAATGTG TTTTTAATGT CGCCTCAACT
TCAAAGCAAT ATACCAAGGG GGTAATTAAA GGGAAATGTA CTATACCTGG CGACTTTCTG
AATAATGGGT CCTATTCCAT AGACCTGTCA TTTGTTAAAA ACACCAGCAG TCCATTGTTT
GATTTTGAAG AATGCTTATC TTTTGAAGTG GAGGACTTTA GGGAGAATAC GGCATGGTAT
GGCGACTGGG TTGGCTCGGT TAGGCCAAAG TTTAAAGTAC AACTGCAACA AGACAACTTT
TAA
 
Protein sequence
MSNLMLKAEN ISKYYRLGVI GSGSFKEDLQ NLWKKTFSIG NNTATNDLHI DKGKELWALK 
DINFEIMKGD VVGFVGKNGA GKSTLLKVLS RITLPTTGTI KGKGRIASLL EVGTGFHLEL
TGRENIFLNG QILGMHKKEI IAKYDEIVAF SGIERFLDTP VKRYSSGMYV RLAFAIAAHL
DPEILIVDEV LAVGDAEFQK KCLGKMKQVS TEEGKTVLFV SHNSQALKSL CTKAIYLEKG
RLIDMGNMQD VIGNYLKREQ TLYLSRIYDD PDTAPGNESV RIKRVEMLPQ YPDSSNIIDI
RTPLLIEFEF WYLPAEEMDL GVNIILNTVM GECVFNVAST SKQYTKGVIK GKCTIPGDFL
NNGSYSIDLS FVKNTSSPLF DFEECLSFEV EDFRENTAWY GDWVGSVRPK FKVQLQQDNF