Gene Phep_3884 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3884 
Symbol 
ID8255018 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4670505 
End bp4672625 
Gene Length2121 bp 
Protein Length706 aa 
Translation table11 
GC content47% 
IMG OID644937548 
Productshort chain dehydrogenase 
Protein accessionYP_003094137 
Protein GI255533765 
COG category[S] Function unknown 
COG ID[COG3347] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02632] rhamnulose-1-phosphate aldolase/alcohol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTACTC GAACAACCGA TTTTAAACAT GTAAGCTATT TGTGGGATGA TGCAAAGGCT 
GCTGAAATGG CCGGGGATGA AGTAGCATTG TTGATCTACC GTTCAAATTT ACTGGGTGCA
GATTTGCGCT TAACCAATTA CGGTGGAGGT AACACCTCCT GCAAGGCCAT GGCCAAAGAC
CCGCTTACCG GAGAAGAAAA AGAGGTAATG TGGGTAAAAG GGTCTGGCGG CGATTTGGGT
ACCTTAAAGA AAAGCGGCCT GGCCGCTTTA TATGTTGACC GCCTGCGCAA CCTTAAAAAT
GTTTACCGGG GTTTGGCATA CGAAGATGAA ATGGTGGAAT TGTTTAACCA TTGCATTTAC
GATCTGGACA GTAAAGCGCC TTCTATTGAC ACCCCTTTAC ATGGCTTTTT GCCTTTTAAA
CATATTGATC ACCTGCATCC CGATGCAGCC ATCGCTATTG CAGCTGCAAA AGATGGCGAA
CGCATTACCA AAGAATTGTT TAACGGAACA ATTGGCTGGG TGCCATGGCA AAAACCTGGT
TTTGAGCTGG GGCTGATGCT TCGTAAATGC CTGGACGAAA ACCCTGGCAT CAGGGGCATT
ATGTTAGGTT CACATGGTTT ATTTACCTGG GGCGATACAG CTTACGAAAG CTATATGAAT
ACCCTTGAAG TAATTGAAAG GTGTGCTGCA TACCTGGATG AAAATTATGG TAAAAAAGGT
CCGGTATTTG GCGGACAGAA AATTGAAAGT GCCGAAAAAA CACAACGTGC AAAACAGGCC
GCAGCTATTG CCCCTGTTTT AAGGGGGTTC TGCTCGAGCG AAAGAAACAT GATCGGGCAC
TTTACCGATG ATGCCAGGGT ACTGGAATTT ATCAATTCAA ATGACCTGGA CCGTCTGGCC
CCTTTGGGTA CCAGCTGTCC TGACCATTTC CTGAGGACTA AGATCAGCCC TCTGGTATTG
AAACTGGATG CCAGTGAAGA CCTGAGTGAT GTGTCCTCTC TGAAAGAAAG GCTGGCCCCT
GAATTTGAGG CTTACCGTAA AATGTATACC GAATATTACA ATTCGTGCAA ACATGCCAAT
AGCCCGGCCA TACGTGATGC CAACCCGGTC ATCATTTTAT ATCCGGGCAT CGGGATGTTC
TCGTTTTCAA AAGATAAACA AACGGCAAGG GTGGCTGCAG AATTCTACAC CAACGCCATC
AATGTAATGA AAGGCGCCGA AGCCATTTCT GAATACACTT CTTTACCGAG ACAGGAAGCG
TTTAACATTG AATATTGGCT GCTGGAGGAA GCAAAGCTGC AGCGCATGCC TAAGCCGAAA
GCCTTGTCGG GAAAAATAGC CCTGATTACC GGAAGCGCAG GCGGGATTGG TAGAGCAATT
GCTAAAAAGT TTGTAGAAGA AGGTGCTGTA GTGGTACTGA ACGATATGAA CGCAGAACGC
CTGGAAGGTG CTGCTGAAGA GTTCAAAAAC AAATATGGTA AAGACAGTTA TGCCACTGCC
TTGCTGAATG TAACCAGTGC TGAAGACATT AACGCTGCAT TTGATGCTGC TGCGCTTGCT
TTTGGCGGGG TAGATATTAT TGTGAACAAT GCAGGCCTTT CTATTTCGAA AACCATAGCA
GACCATACCG AGAAGGATTG GGACCTGTTG TATGACGTAT TGGTAAAAGG CCAGTTTTTC
ATTACCCAGG CTGCTGCAGC GGTAATGAAA AAACAGGATA TCGGTGGTGA TATCCTGAAC
ATTGTAAGCA AAAACGCATT GGTAAGCGGG CCAAACAATG CGGGTTATGG CAGTGCCAAA
GCAGCTCAGC TGCACTTAAG CAGGTTAAAC GCTGCCGAAC TTGGTGCAGA TGGCATTCGT
GTAAACGTGG TTAACCCTGA TGCAGTGATC AGCGACAGCA ATATCTGGGC TGGTGGCTGG
GCCGAAGGCC GGGCAAAAGC TTACGGAATT ACCGTGGCAG AGTTGCCAGC TTACTATGCA
AAACGTACCT TGCTGAATGA AATAATTTTA CCGGACGATA TAGCCAATGC CTGCTTCGCT
TTTGTAGGCG GCCTGCTCAA TAAATCAACC GGAAATGTAT TAAATGTGGA TGGTGGTGTA
GCCAACGCCT TTGTCCGTTA A
 
Protein sequence
MSTRTTDFKH VSYLWDDAKA AEMAGDEVAL LIYRSNLLGA DLRLTNYGGG NTSCKAMAKD 
PLTGEEKEVM WVKGSGGDLG TLKKSGLAAL YVDRLRNLKN VYRGLAYEDE MVELFNHCIY
DLDSKAPSID TPLHGFLPFK HIDHLHPDAA IAIAAAKDGE RITKELFNGT IGWVPWQKPG
FELGLMLRKC LDENPGIRGI MLGSHGLFTW GDTAYESYMN TLEVIERCAA YLDENYGKKG
PVFGGQKIES AEKTQRAKQA AAIAPVLRGF CSSERNMIGH FTDDARVLEF INSNDLDRLA
PLGTSCPDHF LRTKISPLVL KLDASEDLSD VSSLKERLAP EFEAYRKMYT EYYNSCKHAN
SPAIRDANPV IILYPGIGMF SFSKDKQTAR VAAEFYTNAI NVMKGAEAIS EYTSLPRQEA
FNIEYWLLEE AKLQRMPKPK ALSGKIALIT GSAGGIGRAI AKKFVEEGAV VVLNDMNAER
LEGAAEEFKN KYGKDSYATA LLNVTSAEDI NAAFDAAALA FGGVDIIVNN AGLSISKTIA
DHTEKDWDLL YDVLVKGQFF ITQAAAAVMK KQDIGGDILN IVSKNALVSG PNNAGYGSAK
AAQLHLSRLN AAELGADGIR VNVVNPDAVI SDSNIWAGGW AEGRAKAYGI TVAELPAYYA
KRTLLNEIIL PDDIANACFA FVGGLLNKST GNVLNVDGGV ANAFVR