Gene Phep_4183 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_4183 
Symbol 
ID8255318 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp5062084 
End bp5063790 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content42% 
IMG OID644937848 
Productglycoside hydrolase family 3 domain protein 
Protein accessionYP_003094436 
Protein GI255534064 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.101852 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGTTA AGCATTTTTT TATAGCGACT TTACTTTTAA TAACAACAGT ACTTACTGCA 
CAAGGCCAAA CAAAAACATA CTTACAAACA CTGCAGGAAC CCAACCACTG GGTAGATTCT
GTATTTAAAA AACTGAGCAA ACGACAAAAA ATTGCCCAGC TTTTTTTTGT CAGGGCGCAT
ACCAATTTTG GGAAAGCTTT TGAAGATTCT ATAGGTAAAG TAATTAAAAA AGAACGCGTT
GGTGGTCTGG TATTTTTCCA GGGTGGTCCG GGCAGGCAAG CCATTTTAAC CAATACCTAT
CAGTCACTGG CCAGGGTACC TTTATTGATT ACTTCTGATG GGGAGTGGGG ATTGGGCATG
CGTTTAGACA GTACCATTTC TTATCCTTAC CAGATGGCGT TAGGAGCAGT ACAGAATAAA
GACCTGCTGT ATAAAATGGG CCTTGAAGTT GCCAGAGATT ATAAAAGAAT TGGTATGCAC
ATGAACTTGG CACCTGATGT AGACGTAAAT AATAACCCTA AAAATCCAGT GATCAATTTC
CGTTCTTTTG GCGAAAACAA ATACAATGTA GCTACAAAAG CGGCTGCGTA TATGAAGGGC
ATGCAGGATG GCGGCCTATT GGTGAGTATT AAACATTTTC CAGGACATGG AGATACGGAT
GTAGATTCGC ACTACGACCT GCCCCAGCTA AATTTTACCC GGGCACGTCT GGATAGCCTG
GAGATCTATC CCTTCAGAGA ACTGATCCGG GAAGGTGCTG CAGGTGTAAT GATTGCACAC
ATGAACATTC CGGCATTGGA CAATACCCCA AATATGCCTT CTACTTTATC TAAACCAATT
GTAACCGGAT TGCTTAAAGA AGAACTTGGG TTTAAAGGAA TTGTGATCTC GGATGCGATG
GGTATGAAAG GGGTGGTAAA AAATTTTAAA GATGGCGAAG CTGATGTAAT GGGCATTATT
GCCGGCAACG ATATTTTAGA GTTGTCTGAA AACAGTGCAA GAGCCATAAA ACTGGTACGT
AAGGCCGTAA GAGCAGACAG GATCAGTATG GAACAGATCG ACGCAAGTGT AAAAAAAATC
CTTACAGCTA AATATTGGGC AGGTTTAAAT GTTAAACAGA AAGTTGATGA AAATAATGTC
GTAGCCGGGG TGAACAGACC AGAAAGCCTG GCATTGCTGC AACAGCTGGC CGATGCTTCT
ATGACAGTTT TAACGGGTAA AGACAATATC AAACAGCTAA GTGCAGAGAA AAGGACAGTT
ATCATTAGCG TCGGTACACC AGAGGTTACC ACCTTTCAGC GCGAACTGGG TACTTATTAT
AAAAATTCGG TTTTTTATAC GCTGGATAAA AATGCATCAG CCAGCCAGAT TGCAAAGGTG
CTGCGCGAGC TGAAAGCGTT TGACCAGCTG ATAATAGGTA TCCATGATAC GCGTTTACGT
CCGGGTAATG GCATGGTGTT GAGCGCCGAC CTGAAAATAT TTATTAAGGA TATGGCCTTG
AGCAACACTG TATTTGCTTT ATTTGCCAAT CCCTATAATT TGGCCGCCTT ACCTGGTTTA
GAGCAAAGCA AGGGCCTTAT AGTTGCCTAT CAGAAAGAAG ACTTTATGCA GAAGGCTGCA
GCTTCAGTAA TTAAAAATCA GCTGGATGCT ACCGGAAAAC TGCCGGTTAC AGTAAACTCC
TTTTTTAAAT ACGGCGACGG ATTGTAA
 
Protein sequence
MNVKHFFIAT LLLITTVLTA QGQTKTYLQT LQEPNHWVDS VFKKLSKRQK IAQLFFVRAH 
TNFGKAFEDS IGKVIKKERV GGLVFFQGGP GRQAILTNTY QSLARVPLLI TSDGEWGLGM
RLDSTISYPY QMALGAVQNK DLLYKMGLEV ARDYKRIGMH MNLAPDVDVN NNPKNPVINF
RSFGENKYNV ATKAAAYMKG MQDGGLLVSI KHFPGHGDTD VDSHYDLPQL NFTRARLDSL
EIYPFRELIR EGAAGVMIAH MNIPALDNTP NMPSTLSKPI VTGLLKEELG FKGIVISDAM
GMKGVVKNFK DGEADVMGII AGNDILELSE NSARAIKLVR KAVRADRISM EQIDASVKKI
LTAKYWAGLN VKQKVDENNV VAGVNRPESL ALLQQLADAS MTVLTGKDNI KQLSAEKRTV
IISVGTPEVT TFQRELGTYY KNSVFYTLDK NASASQIAKV LRELKAFDQL IIGIHDTRLR
PGNGMVLSAD LKIFIKDMAL SNTVFALFAN PYNLAALPGL EQSKGLIVAY QKEDFMQKAA
ASVIKNQLDA TGKLPVTVNS FFKYGDGL