Gene Phep_3883 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3883 
Symbol 
ID8255017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4669219 
End bp4670493 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content48% 
IMG OID644937547 
ProductXylose isomerase domain protein TIM barrel 
Protein accessionYP_003094136 
Protein GI255533764 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4952] Predicted sugar isomerase 
TIGRFAM ID[TIGR02629] L-rhamnose catabolism isomerase, Pseudomonas stutzeri subtype 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATATTAG AGAACTATCA GATAGATTCA CATAACAATG AGCTGTTGGC TAAACATCAG 
CGTAAACTTA CCTTTGTTAA AGATGAGCTT GCCAATGCAG AGGAGATCAT ACAGAAACTG
ATTGACTTCC AGATCGGGAT TCCCAGCTGG GCATTGGGCA CAGGGGGTAC ACGTTTCGGA
CGTTTCTCCT CGGGTGGTGA GCCCCGCAAC ATCGAAGAAA AGATTGAAGA CATCGGCCTG
CTGCATAAAT TAAATGCAGC AAGCGGAACC ATATCCCTGC ACATTCCATG GGATACGGCT
GATAATGTAC CGGCTATAAA AGCTTTGGTA GCACAGCATG GCCTGGGCTT TGATGCCATG
AACTCGAATA CCTTTCAGGA CCAGGCCAAC CAGGCGCACA GCTATAAATT CGGTTCCTTA
CAGCATGTAA ACAAGGCGGT AAGAAAACAG GCCATTGAAC ACAACATTGA AGTGATCCGT
CAGGGTGTGG AGCTGGGCTC TAAGGCACTT ACGGTATGGC TGGCAGATGG CTCCTGTTTC
CCCGGACAGC TGAACTTCAG GAAAGCCTTC CAGAACACAT TGGAAAGCCT GCAGGAAATT
TATGCAGCTT TGCCGGCCGA CTGGAAAGTA TTTGTAGAAT ATAAGGCTTT TGAGCCCAAC
TTCTATTCTA CTACCGTGGG CGATTGGGGG CAGTCCTTAT TGTACGCCAG TAAACTAGGC
CCTAAAGCCT TTACACTGGT CGACCTTGGT CACCACCTGC CAAACGCCAA CATAGAGCAG
ATCGTTTCTT TATTGCTGAT GGAAGGAAAG CTGGGTGGGT TCCATTTCAA CGATTCCAAA
TACGGCGATG ACGACTTAAC GGCAGGTAGC ATGAAGCCTT ACCAGCTGTT CCTGATCTTT
AATGAACTGG TGGAAGGAAT GGATGCACGT GGTATGGACC ACAGGAAAGA CCTGGGCTGG
ATGATCGATG CCTCACATAA TGTTAAAGAC CCCTTAGAAG ATCTTTTACA ATCGGTTGAA
GCCATTATGC TGGCCTATGC GCAGGCACTG TCAGTAGACC GTAAAGCATT GCAGCAGGCC
CAGGAAGAAA ATGATGTGGT ACGTGCCCAG GAAATATTAC AACAGGCATT CCGTACCGAT
CTGCGACCTT TGGTAGCAGA AGCACGTTTG CGTGCAGGTG CAGCCCTGGC TCCGCTTGAG
CTGTTTAGAA ATAGCAAGGT AAGACAAGAA CTGATAAAAA GCCGTGGTTT AAAAACTGTA
GCAACTGGAT TATAA
 
Protein sequence
MILENYQIDS HNNELLAKHQ RKLTFVKDEL ANAEEIIQKL IDFQIGIPSW ALGTGGTRFG 
RFSSGGEPRN IEEKIEDIGL LHKLNAASGT ISLHIPWDTA DNVPAIKALV AQHGLGFDAM
NSNTFQDQAN QAHSYKFGSL QHVNKAVRKQ AIEHNIEVIR QGVELGSKAL TVWLADGSCF
PGQLNFRKAF QNTLESLQEI YAALPADWKV FVEYKAFEPN FYSTTVGDWG QSLLYASKLG
PKAFTLVDLG HHLPNANIEQ IVSLLLMEGK LGGFHFNDSK YGDDDLTAGS MKPYQLFLIF
NELVEGMDAR GMDHRKDLGW MIDASHNVKD PLEDLLQSVE AIMLAYAQAL SVDRKALQQA
QEENDVVRAQ EILQQAFRTD LRPLVAEARL RAGAALAPLE LFRNSKVRQE LIKSRGLKTV
ATGL