Gene Phep_1454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1454 
Symbol 
ID8252555 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1725232 
End bp1726710 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content43% 
IMG OID644935108 
Productglycosidase PH1107-related 
Protein accessionYP_003091730 
Protein GI255531358 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2152] Predicted glycosylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.719751 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.786023 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATTAT TAATAGAACG AAAACCAGTT AAAGTATACC CAGATCCTAA ACGGGTAATT 
GCAAGATTTT TTTTTAACGG AGATGAAAGA GCTGTAGAAG TGGTGAGGCA TGTACTGGCC
CTTTCGGATA AAGATGTATT CGGACTGATC TCGCCTTTGT TACAGGAATA TTCCAAGAGG
CACCGGAACA TCACCAAAAT ACTTGCCCGT CACTGTAAAA AGATCAAACA CTGTATAGAA
ACGGCGGGTT ACGATTTTGA ATCGCTTGAT AAATATCAGA AGTTGCTGCT TGGATCCTAT
TTCACACATG AATATTCTAT AGAATCTGCA GCTTTTTTTA ACCCTTCGGT AGTAGAAGAC
CCTGATCAGT CCGACCTGGT TGAAGGTGAA AAAAGGCTGA TCATCAGTTT CAGGGCAGTA
GGTGAGGGGC ACATTTCGTC TGTCGTGTTC CGTCGTGCAC TGATAGACCG TAACCATAAC
ATTACGGTAA TCCCGGTAGG CAATTATATT GATGAAGCGG AGATCATTAA GAACGCGGTC
TATAATAAAA AACTATTTCT TAAAAAAGCG GCTGATTCCC GCATAGATAT AGAGGTTTTG
GATGAGGTAG GTGTTAAACT GGAGGATAAA TTTGATTATG CCACACTCAG AAAGATCATT
CTTGACAGCA AAGGTCTGCA GGAAGATGAC CTTAAAAAAC TGGAATATGA CAAGATCCTC
TGGTTGTCGG ATACCTACCA TGAGATCAGC TTTTCGAGGG ACACCGATAT TTCTGACCGG
GTGATTTTCC CCATCTCTGA ATTTGAGCGC AAGGGTATAG AAGATGCCAG GTTTGTCCGT
TTTGTAAAAG ATGATGGCAG CATTATTTTT TATGCCACCT ATACTGCTTT TGATGGGGCC
ATGATTATGC CCAAGCTATT GCAGACCACG GATTTTTACG ATTTTAAGAT CAGTCCGCTA
CATGGTATTG GTGCGCAGAA TAAAAATCTG GCACTTTTTC CACGTAAGAT CAACGGCAAA
TATGCCATGA TGTCGCGCAT AGATGGCTGG AACAACTACC TAATGTATTC CGATAAGCTT
ACGGTATGGG ATAACCCTGT AAAGCTGCAG GCACCACAGT TTCCATGGGA ATTCATACAG
ATCGGTAATT GTGGTTCGCC TATAGAAACT GAGGCCGGCT GGCTGGTGAT TACCCATGGT
GTAGGGCCAA TGCGCCGTTA TTGCCTGGGA GCCAGTTTGT TTGATCTGAA TGATCCTTCA
ATAGAAATAG GAAGGTTAAA TGAGCCACTG GTTATTCCCA ATACAGATGA AAGGGAAGGG
TATGTACCTA ATGTGTTGTA TTCCTGTGGT TCGATCATTC ACGATGGTGA ACTGATCATT
CCCTACGGCC TGTCGGATTA TTGTTCTTCC TTTGCCACGG TAAAAATAGC ACTGTTACTG
GAAAAACTGG AAAGCACTGA CTTATATCGT CCTGCATAG
 
Protein sequence
MRLLIERKPV KVYPDPKRVI ARFFFNGDER AVEVVRHVLA LSDKDVFGLI SPLLQEYSKR 
HRNITKILAR HCKKIKHCIE TAGYDFESLD KYQKLLLGSY FTHEYSIESA AFFNPSVVED
PDQSDLVEGE KRLIISFRAV GEGHISSVVF RRALIDRNHN ITVIPVGNYI DEAEIIKNAV
YNKKLFLKKA ADSRIDIEVL DEVGVKLEDK FDYATLRKII LDSKGLQEDD LKKLEYDKIL
WLSDTYHEIS FSRDTDISDR VIFPISEFER KGIEDARFVR FVKDDGSIIF YATYTAFDGA
MIMPKLLQTT DFYDFKISPL HGIGAQNKNL ALFPRKINGK YAMMSRIDGW NNYLMYSDKL
TVWDNPVKLQ APQFPWEFIQ IGNCGSPIET EAGWLVITHG VGPMRRYCLG ASLFDLNDPS
IEIGRLNEPL VIPNTDEREG YVPNVLYSCG SIIHDGELII PYGLSDYCSS FATVKIALLL
EKLESTDLYR PA