Gene Phep_3766 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3766 
Symbol 
ID8254898 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4511424 
End bp4512902 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content42% 
IMG OID644937428 
ProductABC transporter related 
Protein accessionYP_003094019 
Protein GI255533647 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0098796 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTACAAC TAAAAAATAT CAGAAAATCG TTTGGAGGCG TTCATGCACT CAAGGGAGTG 
GAGCTCTGTG TTAAAGATGG AGAAATTCAT GCTTTATTGG GCGAAAATGG CGCTGGGAAG
TCGACCTTAA TGAAGATCAT TTCGGGTGCG CATATTGCTG ATGAGGGAGA GATTTTTTTC
AATGATAGTA AAATCAAGCA TAATTCCCCT CATGTAGCCC AGCAACACGG GATCAGTATC
ATTTACCAGG AGTTTTCCCT GGTTCCTGAT CTTTCCGTTA CCGAAAATCT TTTTTTAAGC
AGGTTTGCAA AATCCTCCTG GATAAATTGG GAGCGTATGC ATAAAGAAGC GGCATCACTG
ATCAACAGTT TAGGTTTCGA TATGGATGTG AAATCTATTG TCAGAACGCT GACGGTTGCA
CAGCAGCAAA TTGTGGAAAT TGCAAAAGCA TTATCTCAAA ATGTTAAATT ACTAATACTG
GACGAGCCTT CCTCTGTGTT GGGGCCTAAG GAGGTGAAAA GATTGTTTGA GATGCTGAAA
GGTTTGAAAG CCAAAGGTGT ATCTATAATT TATATATCCC ATCATTTAGA AGAACTACTG
GTGCTTACCG ATAAGATCAC TGTGCTTAAA GATGGAAAAA CCATAGAAAC AGTGGAGACG
GATACCGTTG ACAAAGATCG GCTGGTATCC CTCATGGTGG GAAGAGAACT GGGGCAGATG
TATCCGCAAA AAAATAAGGC TATTGATCAG CAAAGTAAGG TCGAGATCAA AAGCTTGTCT
ACCAGGTTCA CAAAAGAACC TTTGAGTTTT GATATCCATA GAGGAGAAAT TGTAGGGATA
GGCGGACTTG TAGGGTCGGG CCGTACAGAA GTACTGGAGT CTTTATTTGG ATCGGACCGG
ATCCATGCCA ATGAGATTTT ATTTGGTGGG AGGGTATGGA GTTTCAAAAG ACCAGAGCAG
GCCATCCGTC AGGGATGGGG AATGCTTCCT GAAGACCGTA AAAAAAGCGG AGGTGTCTTG
GATTTGAGTA TCAAACAAAA TATTTCGCTG GCCAACTTAG GTAAGATTGC CAACTCCTGG
GGATTTATCA ATGAAACGAA GGAAAATGAA ATAGTGGGAA GGTTGATCAA AAAACTCAGG
ATTAAAGTTG GCGACGCAAC ACATGCTTTG TCTACTCTGA GTGGGGGAAA TCAGCAGAAA
GTTATTCTGG GCAAATGGCT CAATCTGGAT TTAAAGGTAT TGTTGATTGA TGAACCTACC
CGGGGTGTTG ATGTAGGCGC GAGATCTGAA ATATACCATA TCATCCAAAA GTTAGCCGAT
GATGGGGTTT TTGTGCTGAT GGTCTCCTCT GATATGGATG AATTGATGGG CCTTTCTGAT
CGGATCCTTG TATTTAAAAA TGGTACGCTA CAGGGAGAAG TTTTACGTCC GGATTTTAGT
GAAGAAGCAG TTTTAAGAAT GGCTATTGGT GCCAAATAA
 
Protein sequence
MLQLKNIRKS FGGVHALKGV ELCVKDGEIH ALLGENGAGK STLMKIISGA HIADEGEIFF 
NDSKIKHNSP HVAQQHGISI IYQEFSLVPD LSVTENLFLS RFAKSSWINW ERMHKEAASL
INSLGFDMDV KSIVRTLTVA QQQIVEIAKA LSQNVKLLIL DEPSSVLGPK EVKRLFEMLK
GLKAKGVSII YISHHLEELL VLTDKITVLK DGKTIETVET DTVDKDRLVS LMVGRELGQM
YPQKNKAIDQ QSKVEIKSLS TRFTKEPLSF DIHRGEIVGI GGLVGSGRTE VLESLFGSDR
IHANEILFGG RVWSFKRPEQ AIRQGWGMLP EDRKKSGGVL DLSIKQNISL ANLGKIANSW
GFINETKENE IVGRLIKKLR IKVGDATHAL STLSGGNQQK VILGKWLNLD LKVLLIDEPT
RGVDVGARSE IYHIIQKLAD DGVFVLMVSS DMDELMGLSD RILVFKNGTL QGEVLRPDFS
EEAVLRMAIG AK