Gene Phep_3767 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3767 
Symbol 
ID8254899 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4512911 
End bp4513942 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content45% 
IMG OID644937429 
Productperiplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_003094020 
Protein GI255533648 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0706737 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAAAT TCGGGTATAA AGTATGGTCC ACTTTAGGCT TATCTCTGTT TCTTGTTGTA 
GGGTGTTTGC CTAAAGAAGA TAGATTAAAC AACAAAAAAA AGGAAGGAAA GGAGGTGGTT
AAAGGCTATA CTTCTAAAAA GGATCTCTCG GCAGTAAAAG TGGGTTACAG CGGTCCCTCG
ATGGTTGCCC CCTATTATGT GGCATTGGAA GATGTAGTGA AGCGAAGTGT GCAGCGATAT
GGAATGCAAT ATTATACGGC CGATGGGCAG GAAGATGTAG CTAAGCAGGT TGCTGCTATA
GAAGACCTGT TGTCTAAAGG CATACAGGTG CTGGTGCTAA ACCCGCTGGA CCCAAAGGCT
GTGGTGCCGG TTGTAAACAG GGCAATTGCT GAAGGGGTGG TCGTGTTTAT TGTAGATTCA
ATGATTGATG AAAAAGCCGC TTATACCTCA TCAGTTGTGG CTAATAATAC TTTAAATGGG
GAGTTGCTGG GGCTTTGGCT GGCAGAGACA AAGAACGAGG CCTTAAAGAT CGCGATCATT
AGTGGTAACC AGGGCAATCC CGTGGGACGA GAAAAAAGAC TCGGCTTTGT GAGAGGCCTG
GCAGACGGAC AATTGCGTCA AAATGCGAAA ACCAATTTCG ATATCGTAGC ACAGGGCTGG
GGTGGCTGGA ACAACAACGG GGGACTTAAA GCTATGGAAG ATATCCTGGC CGCACACCCC
TATGTAAATG TTTTACTGGC AGAAAACGAT GCGATGGCCC TGGGTGCCTA CAAAATTATT
AAGCAGATGG GCAAAGAAAA CCAGATCACT ATTTTAGGAT TCGACGGGCA GAAAGAAGCT
TTTGATATGC TCAAAACCGG AAAATTTGAA GCGACTGCAC AAAACAGCCC GAAAATATTG
GGGGAGACGA TCATAGAACT CGTTGCCCGG CATCTTAACG GCGAAAAGGT AAATAAACTG
AATTATACCC CTTCCGTATT GATCAGCAAA AAGAATGTGG ATGAATATTA CGATGCTAAG
GCCTTATTTT AA
 
Protein sequence
MMKFGYKVWS TLGLSLFLVV GCLPKEDRLN NKKKEGKEVV KGYTSKKDLS AVKVGYSGPS 
MVAPYYVALE DVVKRSVQRY GMQYYTADGQ EDVAKQVAAI EDLLSKGIQV LVLNPLDPKA
VVPVVNRAIA EGVVVFIVDS MIDEKAAYTS SVVANNTLNG ELLGLWLAET KNEALKIAII
SGNQGNPVGR EKRLGFVRGL ADGQLRQNAK TNFDIVAQGW GGWNNNGGLK AMEDILAAHP
YVNVLLAEND AMALGAYKII KQMGKENQIT ILGFDGQKEA FDMLKTGKFE ATAQNSPKIL
GETIIELVAR HLNGEKVNKL NYTPSVLISK KNVDEYYDAK ALF