Gene Phep_3395 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3395 
Symbol 
ID8254514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4038620 
End bp4039687 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content47% 
IMG OID644937047 
Productglycoside hydrolase family 43 
Protein accessionYP_003093651 
Protein GI255533279 
COG category[R] General function prediction only 
COG ID[COG3940] Predicted beta-xylosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCCAA CCATAACAAC CAGCAGTTTT TCTTTTCTTG TATTGTTCTG TTTAATTTTA 
GGGCATACAG CAACGCAGGC CCAGGAAAAA AACGAAGCGC GGACCTTCGT AAATCCAGTA
GGGCAGGGCG CAGATCCCTG GGTGATCAGG CACAAAAACC ATTATTACGT TTGTCAGAGC
ACCGGAGGGA TCGCTGGTAG AGGTATTTCT GTAAGTAAAT CGGATAAACT GAGCAAGCTG
GGCAAGGCAG TTATCGTGTG GAATGCACCT CGAAAAGGCT GGAACAGCAA TCAAATCTGG
GCGCCAGAGC TCCACTATTT CAACAACAAA TGGTACATCT ATTATGCGGC AGGCCAATCT
GGCCCTCCGT ATATCTATCA GCGTTCAGGC GTACTCGAAT CAGTAACAGA TGACCCGCAG
GGAAAATATA TAGATAAAGG CCTGTTAAGC ACCGGTGCAG ACCCAAAAGA TCAAACCGGA
AATATCTGGG CCATAGATGT TAATGTAGCA GAAATCAGAG GGAAACTCTA TGCTGTATGG
TCTGGTTGGG AAAAGAATGC GGCTACCGAT AAAACAGTTC AACACCTGTA TCTGGCCAGG
ATGAGCAATC CCTGGACAAT CAGTTCAGAA AGGGTAAAAA TTTCCAGTCC TGACCAGCAA
TGGGAAACAG GCGGTCCACT GAACTTGAAC GAGGGCCCTC AGTTTCTGCT TCGTAAAGGA
CAGGTTTTTA TTGTGTATTC CACCCGCGAA TCCTGGACAC CAGAGTACCG CCTTGGCTTG
TTAAGACTTA AAAAAGAGGC CAAAGTATTG CTTGATGCGA AGAGCTGGGA AAAAACGGGA
CCTGTGTTTC AGGGAACAGA ACAAGTCTTT GGTCCTGGAC ATGCTAGTTT CACGCAATCT
CCGGATAACA AAGAATGGTG GATCTTTTAT CATGCCAAGA AAACTACAGA ACCAGGTTGG
TCGCGGGACA TGCGCTTACA AAAGTTTAGC TGGAACCCGG ATGGGAGCCC AAATTTCGGT
ACACCTATCC CGGCAGGTGT GGCCATTCAG GTACCATCCG GAGAGTAA
 
Protein sequence
MNPTITTSSF SFLVLFCLIL GHTATQAQEK NEARTFVNPV GQGADPWVIR HKNHYYVCQS 
TGGIAGRGIS VSKSDKLSKL GKAVIVWNAP RKGWNSNQIW APELHYFNNK WYIYYAAGQS
GPPYIYQRSG VLESVTDDPQ GKYIDKGLLS TGADPKDQTG NIWAIDVNVA EIRGKLYAVW
SGWEKNAATD KTVQHLYLAR MSNPWTISSE RVKISSPDQQ WETGGPLNLN EGPQFLLRKG
QVFIVYSTRE SWTPEYRLGL LRLKKEAKVL LDAKSWEKTG PVFQGTEQVF GPGHASFTQS
PDNKEWWIFY HAKKTTEPGW SRDMRLQKFS WNPDGSPNFG TPIPAGVAIQ VPSGE