Gene Phep_0129 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_0129 
Symbol 
ID8251214 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp154802 
End bp155851 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content44% 
IMG OID644933779 
Producthelix-hairpin-helix motif protein 
Protein accessionYP_003090417 
Protein GI255530045 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTTTC AATCCGAAAT CGTAAACTGG TACTTAAACC ATAAAAGGGA CCTACCCTGG 
CGGGGTACTA CAGATGCATA CATCATCTGG TTATCCGAAG TGATCCTGCA GCAAACCCGG
GTAGACCAGG GCCTGCCCTA TTTCAATAAT TTTTTACAGA ACTATCCAAC TGTGCTCGAC
TTTGCCAGCG CCAGTGAAAC ACAGGTGCTT AAACTATGGC AGGGGCTTGG CTATTATTCC
AGAGGCAGGA ACATGCTTTT TACAGCCAGG CAGGTGCGCG ACCTGCATGG TGGCGTTTTT
CCGGTGCGCT ATGATCAGCT CATTAAACTA AAAGGGATTG GCGAGTATAC CGCCGCTGCC
ATTGCTTCTT TTTCTTCAAA TGAATCAAAA GCAGTGCTCG ATGGCAATGT GTTCCGTGTA
TTGTCCCGTT ATTTCGGTAT AGAAAGCCCC ATAAACAGCA GCACCGGTAA AAAACAGTTC
GCAGACCTGG CCCAGTCACT CATCAGTGGC CAGCAGCCTT CAGTATACAA TCAGGCCATT
ATGGAGTTCG GCGCATTGCA GTGCAAACCC AAATCTCCCA ATTGTGGTAT ATGTCCGGTT
CAGGACAGCT GTTTTGCTCA AAAGCATCAT CTGGTCGGTA CGCTTCCTGT AAAATTGAAC
AAACTGAAAA AACGTACCCG CTATTTCAAC TATTTCCTGT GTATGGAAGG CGACAATATC
CTGGTCAAAA AAAGGAGCCC TGGCGATATA TGGCAGGAAT TATATGATTT TCCACTCATC
GAAACAGACC GGCCTTTTCT GGAAGATCCC GAAAAGTTTG CCCCTTTGTT ACAGGAAAGC
TTTGGCGCAG CTTGTAAAGT CAGGACTTTA TCCCATCAAA AACATTTATT AACACACCAA
ACTATATATG TTCAATTTTT TGGTTTAGAT AATTATATCA TTAACTTTAA TCAGAATGCA
GAAATAAAAT GGGTCTCATT GCCGGAATTC GACGAATTGC CGCAACCTAA AGTGATCACC
AATTTTGTTT GTAAGCATTT TATTACATAG
 
Protein sequence
MSFQSEIVNW YLNHKRDLPW RGTTDAYIIW LSEVILQQTR VDQGLPYFNN FLQNYPTVLD 
FASASETQVL KLWQGLGYYS RGRNMLFTAR QVRDLHGGVF PVRYDQLIKL KGIGEYTAAA
IASFSSNESK AVLDGNVFRV LSRYFGIESP INSSTGKKQF ADLAQSLISG QQPSVYNQAI
MEFGALQCKP KSPNCGICPV QDSCFAQKHH LVGTLPVKLN KLKKRTRYFN YFLCMEGDNI
LVKKRSPGDI WQELYDFPLI ETDRPFLEDP EKFAPLLQES FGAACKVRTL SHQKHLLTHQ
TIYVQFFGLD NYIINFNQNA EIKWVSLPEF DELPQPKVIT NFVCKHFIT