Gene Phep_3080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3080 
Symbol 
ID8254197 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3678912 
End bp3680153 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content42% 
IMG OID644936733 
ProductATP-dependent Clp protease, ATP-binding subunit ClpX 
Protein accessionYP_003093339 
Protein GI255532967 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1219] ATP-dependent protease Clp, ATPase subunit 
TIGRFAM ID[TIGR00382] endopeptidase Clp ATP-binding regulatory subunit (clpX) 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.01313 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAAAC AAAACAAAGA ATCCCGTTGC TCTTTTTGCG GCTCAGGTAA GCAGGATACA 
CTAATGCTTA TTGAAGGACT GGATGCATTT ATCTGCGATA AGTGTGTAAC CCAGGCCAAT
CAGCTGCTGG TACAGGAGCT GGGGAGTAAA AAATCGAAAG CTTTAGATAC TTCGATCACG
CTGTTAAAGC CCCTGGAAAT TAAAGCTCAT ATTGACCAGT ATGTAATTGG TCAGGATGAT
GCTAAAAAGG TGCTTGCAGT GGCGGTATAT AACCACTATA AAAGGTTAAG TCAGAAAGTA
GACAAGGGTG ATGAGGTTGA GATTGAAAAA TCCAATATCA TGTTGGTAGG TGAAACGGGT
ACAGGTAAAA CCTTACTGGC TAAAACTATT GCCAAGATAT TACATGTACC TTTCTGTATA
TGTGATGCAA CGGTACTTAC AGAGGCTGGG TATGTTGGTG AAGATGTGGA GAGCATTCTT
ACCCGATTAT TACAGGCTGC TGATTATGAC GTGGCTTCGG CAGAACGTGG CATTGTATAT
ATTGATGAGG TAGATAAGGT GGCACGTAAA AGTGATAATC CTTCTATTAC CCGGGATGTA
TCTGGTGAAG GCGTACAGCA GGCTTTACTG AAGATATTAG AAGGTACGGT AGTAAACGTT
CCACCACAGG GCGGACGTAA ACATCCTGAT CAGAAGATGA TCCCGGTAAA TACAAATAAC
ATTCTGTTTA TATGCGGCGG GGCTTTTGAT GGCATAGAAC GTAAAATTGC CAACAGGCTG
CGTACACAGG CAGTAGGTTA TAAGGTTAAA AAGGACGACG CTGAACTGGA TCTTAAAAAC
CTTTATAAAT ATATTACGCC TCAGGATTTA AAATCGTTTG GTTTAATTCC GGAACTGATT
GGACGTGTGC CGGTTTTGAC CCACCTGAAC CCATTGGATA AGCAGGCATT ACGCAACATC
CTGACCGAGC CTAAAAACTC GCTGTTCCGT CAGTATGTAA AATTGTTTGA ACTGGAAAAT
GTGAAACTTA CATTTGATAA CGAAGTTTTG GACTTTATAG TAGATAAAGC GATGGAATAT
AAGCTTGGTG CAAGGGGCCT GCGCTCTATT TGTGAGGCCA TTATGCTGGA TGCGATGTTC
GAGATCCCTT CTGATACCAG TGTCAAGGAG TTGAGCATTA CACTCGATTA TGCGGTTGAA
AAGTTTGAGA AGGCCGACTT TAAAAAGTTA AAAGCTGCTT AG
 
Protein sequence
MAKQNKESRC SFCGSGKQDT LMLIEGLDAF ICDKCVTQAN QLLVQELGSK KSKALDTSIT 
LLKPLEIKAH IDQYVIGQDD AKKVLAVAVY NHYKRLSQKV DKGDEVEIEK SNIMLVGETG
TGKTLLAKTI AKILHVPFCI CDATVLTEAG YVGEDVESIL TRLLQAADYD VASAERGIVY
IDEVDKVARK SDNPSITRDV SGEGVQQALL KILEGTVVNV PPQGGRKHPD QKMIPVNTNN
ILFICGGAFD GIERKIANRL RTQAVGYKVK KDDAELDLKN LYKYITPQDL KSFGLIPELI
GRVPVLTHLN PLDKQALRNI LTEPKNSLFR QYVKLFELEN VKLTFDNEVL DFIVDKAMEY
KLGARGLRSI CEAIMLDAMF EIPSDTSVKE LSITLDYAVE KFEKADFKKL KAA