Gene ECD_00389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_00389 
SymbolclpX 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp425110 
End bp426384 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content52% 
IMG OID 
ProductATP-dependent protease ATP-binding subunit 
Protein accessionACT42288 
Protein GI253976618 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.612848 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGATA AACGCAAAGA TGGCTCAGGC AAATTGCTGT ATTGCTCTTT TTGCGGCAAA 
AGCCAGCATG AAGTGCGCAA GCTGATTGCC GGTCCATCCG TGTATATCTG CGACGAATGT
GTTGATTTAT GTAACGACAT CATTCGCGAA GAGATTAAAG AAGTTGCACC GCATCGTGAA
CGCAGTGCGC TACCGACGCC GCATGAAATT CGCAACCACC TGGACGATTA CGTTATCGGC
CAGGAACAGG CGAAAAAAGT GCTGGCGGTC GCGGTATACA ACCATTACAA ACGTCTGCGC
AACGGCGATA CCAGCAATGG CGTCGAGTTG GGCAAAAGTA ACATTCTGCT GATCGGTCCG
ACCGGTTCCG GTAAAACGCT GCTGGCTGAA ACGCTGGCGC GCCTGCTGGA TGTTCCGTTC
ACCATGGCCG ACGCGACTAC ACTGACCGAA GCCGGTTATG TGGGTGAAGA CGTTGAAAAC
ATCATTCAGA AGCTGTTGCA GAAATGCGAC TACGATGTCC AGAAAGCACA GCGTGGTATT
GTCTACATCG ATGAAATCGA CAAGATTTCT CGTAAGTCAG ACAACCCGTC CATTACCCGA
GACGTTTCCG GTGAAGGCGT ACAGCAGGCA CTGTTGAAAC TGATCGAAGG TACGGTAGCT
GCTGTTCCAC CGCAAGGTGG GCGTAAACAT CCGCAGCAGG AATTCTTGCA GGTTGATACC
TCTAAGATCC TGTTTATTTG TGGCGGTGCG TTTGCCGGTC TGGATAAAGT GATTTCCCAC
CGTGTAGAAA CCGGCTCCGG CATTGGTTTT GGCGCGACGG TAAAAGCGAA GTCCGACAAA
GCAAGCGAAG GCGAGCTGCT GGCGCAGGTT GAACCGGAAG ATCTGATCAA GTTTGGTCTT
ATCCCTGAGT TTATTGGTCG TCTGCCGGTT GTCGCAACGT TGAATGAACT GAGCGAAGAA
GCTCTGATTC AGATCCTCAA AGAGCCGAAA AACGCCCTGA CCAAGCAGTA TCAGGCGCTG
TTTAATCTGG AAGGCGTGGA TCTGGAATTC CGTGACGAGG CGCTGGATGC TATCGCTAAG
AAAGCGATGG CGCGTAAAAC CGGTGCCCGT GGCCTGCGTT CCATCGTAGA AGCCGCACTG
CTCGATACCA TGTACGATCT GCCGTCCATG GAAGACGTCG AAAAAGTGGT TATCGACGAG
TCGGTAATTG ATGGTCAAAG CAAACCGTTG CTGATTTATG GCAAGCCGGA AGCGCAACAG
GCATCTGGTG AATAA
 
Protein sequence
MTDKRKDGSG KLLYCSFCGK SQHEVRKLIA GPSVYICDEC VDLCNDIIRE EIKEVAPHRE 
RSALPTPHEI RNHLDDYVIG QEQAKKVLAV AVYNHYKRLR NGDTSNGVEL GKSNILLIGP
TGSGKTLLAE TLARLLDVPF TMADATTLTE AGYVGEDVEN IIQKLLQKCD YDVQKAQRGI
VYIDEIDKIS RKSDNPSITR DVSGEGVQQA LLKLIEGTVA AVPPQGGRKH PQQEFLQVDT
SKILFICGGA FAGLDKVISH RVETGSGIGF GATVKAKSDK ASEGELLAQV EPEDLIKFGL
IPEFIGRLPV VATLNELSEE ALIQILKEPK NALTKQYQAL FNLEGVDLEF RDEALDAIAK
KAMARKTGAR GLRSIVEAAL LDTMYDLPSM EDVEKVVIDE SVIDGQSKPL LIYGKPEAQQ
ASGE