Gene Phep_0001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_0001 
Symbol 
ID8255424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp
End bp1260 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content44% 
IMG OID644933650 
ProductPDZ/DHR/GLGF domain protein 
Protein accessionYP_003090289 
Protein GI255529917 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGTTGT TAATGAGCAG GATTAATTTC GGTAAACTTA TCTTATTCCA GCTTATCCTG 
GCCTTTCAGA TCGGCATCGC AATTCCTGCC CGGGCACAGC AGTTCCATTT CACAGAAAAC
AAGATCAAGG ATGCTATCCG GTTTACGATG GTAAAGAACC TGGTGATCAT CCCAGTTTAC
CTGAACGGCC AGGGGCCCTT TAATTTTATT CTAGACACCG GCGTTGGCCC GCTGATCATC
ACCGATCCGG CATTGATTGA TAGTCTTGGC CTGAAGGTTT CACGTTCCAT TAAAATCTCG
GGCCTGGGAA AGGGGGAAGA AATTGATGCA TTTGTTTCAA ACGAAATCTC TGCAAACGTG
GGGAAGGCGG CGATTGAGCA TATCCCTGCA GCGATCTTAA AAAAGGATAT TTTCGGTTTA
TCCAATTACC TGGGCACAAA GATCTACGGA CTGCTGGGGT ATTATTTTTT TAAAAGTTTT
TTGGTACAGA TCAGGTACTC CCAAAAAAGG TTATTGTTTA GCTACCCTGC TGCGGTCCGT
AAAATCAAGG GAGAAAGGGT GTCCATTGAA ATTGTAGGTT ATAAACCTTA TGTCAATATT
GACATGGAGA CCGCTGACCA TAAAAATATA ACGATAAAAA TGCTGGTAGA CAACGGGGCA
AGTCATGCCA TATCTTTGGA AAGGCTCAAT GAGCGCCCTT TCCCCGTGCC TTCAACGTCT
ATTCCCGCCA GTCTTGGTGT AGGCATGAGC GGGCCTATCG ACGGCAGCAT TGGCCGGATC
CCTTCCTTAA CCATTGGCGG GGTTGTGCTC AAAAATATTT TGGCATCTTA CCCCCACTAC
GATGATGTAG CTGCCAAAGT GATACTGAAA GACCGCAATG GTAATTTAGG GGCCGATGTA
TTGAGCCGTT TTAACATCAC TTTTGATTAC GAAGACAATT CTGTATACCT GAAAAAGAAC
AGCCTGTTTA ACCGGCCTTT CGAACATGAC ATGTCGGGCA TAGAGGTATA CCTTCAGGAG
GGCAAGAAAA AACATTATTT CATCAGCAGG ATCGAACCCG GGTCACCAGC CGAAAAAGCG
GGGATACAGG TAGATGACGA GCTCATTTCC ATCAATCTTA TGCCTGTTAT GAGTTATAAA
CTGGATGAAA TCAGCAACCT GTTCAAAGCT GCCGACGGCA AACAGATGAT CCTCACCATT
GTAAGGAACA ATGAGCTGAT GATTAAGGTT TTTAAGCTTA AACAAAGAAT TTAA
 
Protein sequence
MMLLMSRINF GKLILFQLIL AFQIGIAIPA RAQQFHFTEN KIKDAIRFTM VKNLVIIPVY 
LNGQGPFNFI LDTGVGPLII TDPALIDSLG LKVSRSIKIS GLGKGEEIDA FVSNEISANV
GKAAIEHIPA AILKKDIFGL SNYLGTKIYG LLGYYFFKSF LVQIRYSQKR LLFSYPAAVR
KIKGERVSIE IVGYKPYVNI DMETADHKNI TIKMLVDNGA SHAISLERLN ERPFPVPSTS
IPASLGVGMS GPIDGSIGRI PSLTIGGVVL KNILASYPHY DDVAAKVILK DRNGNLGADV
LSRFNITFDY EDNSVYLKKN SLFNRPFEHD MSGIEVYLQE GKKKHYFISR IEPGSPAEKA
GIQVDDELIS INLMPVMSYK LDEISNLFKA ADGKQMILTI VRNNELMIKV FKLKQRI